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the  data  requirements  on  u if  sufficient  information  about  u is  available. 

One  application  we  examine  is  the  effect  of  the  Coriolis  term  in  the 
linearized  shallow  water  equations  on  the  possibility  of  recovering  the  wind 
fields  from  the  geopotential  height. 
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CHAPTER  I 


INTRODUCTION 


Statement  of  the  Problem 

One  major  consideration  in  the  use  of  computers  to  solve  partial 
differential  equations  approximately  in  scientific  applications  is  the 
availability  of  data.  The  equations  governing  many  physical  systems 
have  associated  auxiliary  conditions  which  together  with  the  differential 
equations  give  well-posed  problems--problems  which  have  existence, 
uniqueness,  and  continuous  dependence  of  the  solution  on  the  data. 
Unfortunately,  it  is  often  difficult  to  obtain  the  data  necessary  to 
completely  specify  the  auxiliary  conditions.  Some  physical  quantities 
are  inherently  more  difficult  to  measure  than  others.  Present  systems 
of  observation  may  be  either  incomplete  or  inaccurate,  and  the  necessary 
improvement  of  these  systems  to  provide  adequate  classical  data  may 
be  too  expensive  to  be  practical. 

In  this  thesis  we  study  first  order  hyperbolic  systems  of 
partial  differential  equations  of  the  form 


(1.1) 


k 

Z A, (x, t)u  + B(x, t)u  + f (x, t ) 

J=1  J J 


where 

u = u(x,t)  £ (En  , x € IRk  , 0 < t < tQ 


with  periodic  boundary  conditions  in  x.  The  initial  condition 
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(1.2) 


u(x,0) 


(x) 


and  the  system  (l.l)  form  a well-posed  problem.  We  suppose  that  complete 
initial  data  is  not  available.  More  specifically,  we  assume  that  we  can 
measure  u'*'  = (u^,  ...  , u^)',  perhaps  at  several  time  levels  and  perhaps 
with  its  t derivatives,  but  cannot  measure  u**  = (u„,,,  ...  , u )' 
completely  and  accurately  at  any  time  level.  We  consider  two  questions! 


(1.3)  How  much  information  about  a solution  of  (l.l)  do  we  need 
to  determine  the  solution  uniquely  in  a way  which  depends 
continuously  on  the  data? 

(1.4)  How  do  we  use  this  data  computationally  to  obtain  complete 
initial  data  at  some  time  level? 


We  want  as  much  of  our  data  as  possible  to  be  measurements  of  u1,  which 
we  presume  to  be  available  if  we  need  it,  and  as  little  of  our  data 
as  possible  to  be  measurements  of  u^. 

Several  remarks  on  (1.3)  and  (1.4)  are  in  order.  The  answer  to 
(1.3)  will  certainly  depend  on  the  linkage  between  u1  and  u11  in 
the  differential  equation,  so  appropriate  linkage  conditions  on  system 

i 

(l.l)  will  need  to  be  assumed.  Although  it  would  be  nice  to  include 
the  existence  of  a solution  for  the  data  we  measure  in  (1.3),  we  shall 
see  that  we  often  have  to  overdetermine  the  problem  to  obtain  continuous 
dependence.  The  importance  of  uniqueness  and  continuous  dependence  are 


t S 


iY  i ...  n.  i .i  i .1  . - 


. r ^ • 

■; 


^ *** 
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clear  if  we  plan  to  use  computational  methods.  If  our  data  come  from 
physical  measurements  which  are  sufficiently  accurate  and  the  differential 

L 

equation  is  a good  model  for  the  physical  system,  then  there  will  he  a 
solution  of  the  differential  equation  which  almost  agrees  with  the 
measured  data;  we  assume  this  is  the  case  when  dealing  with  overdetermined 
formulations  of  the  problem.  We  thus  take  the  perspective  in  answering 
(1.5)  that  there  is  a solution  u of  the  differential  equation,  we  know 
certain  theoretically  exact  information  about  u,  and  we  want  to  determine 
u for  all  x and  all  t in  some  interval  0 < t < tQ.  Since  this 
involves  determining  u at  each  fixed  time  level,  since  the  initial- 

? 

value  problem  for  the  system  (l.l)  is  well-posed,  and  since  the  system 
(l.l)  is  reversible  in  time,  it  suffices  to  determine  u at  some  fixed 
time  level,  which  we  take  to  be  t = 0.  In  addition,  constructing  an 
approximation  to  u at  one  time  level  is  sufficient  computationally,  for 
we  can  then  solve  for  u over  the  time  interval  of  interest  by  standard 
difference  methods.  See  Richtmyer  and  Nferton  (1967)  and  Kreiss  and 
Oliger  (1973). 

Application--Numerical  Weather  Prediction 

Problems  of  this  kind  arise  in  geophysical  applications  where 
satellites  are  used  to  collect  data.  One  area  in  which  there  has  been  a 
considerable  amount  of  work  is  global  numerical  weather  prediction.  One 
of  the  simpler  models  investigated,  a barotropic  model,  is  governed  by 
the  shallow-water  equations  (given  here  in  a rotating  Cartesian  coordinate 
system) : 


(1.5) 


u,  + uu  + vu  + cp  - fv  =0 
t x y x 


v.  + uv  + w + cp  + fu  =0 
t x y y 

cp.  +UCD  + vcp  + cp(u  + v ) = 0 
^t  Yy  x y' 


where  u and  v are  the  horizontal  wind  components,  cp  = gZ  is  the 
geopotential  (g  is  the  acceleration  of  gravity  and  Z is  the  height 
of  the  free  surface),  and  f is  the  Coriolis  parameter.  Using  satellites, 
it  is  possible  to  measure  atmospheric  temperature  reasonably  well  over 
the  whole  globe,  but  measuring  the  wind  field  is  more  difficult.  If 
surface  pressure  or  pressure  at  some  reference  height  is  measured,  the 
pressure  field  and  thus  the  geopotential  can  be  determined  from  the 
temperature  field.  In  more  complicated  models  like  the  model  governed 
by  what  meteorologists  term  the  primitive  equations,  the  same  discrepancy 
in  the  availability  of  data  persists:  the  temperature  and  pressure 
fields  are  more  completely  and  accurately  measured  than  the  wind  field. 

See  Oliger  and  Sundstrom  (1976)  for  a discussion  of  several  meteorological 
models. 

The  problems  involved  in  constructing  complete  and  accurate 
initial  data  for  global  weather  prediction  are  more  involved  than  just 
the  lack  of  complete  measurements.  The  task  of  using  all  available 
observations  to  the  best  advantage  in  a numerical  prediction  model  is 
called  data  assimilationj  it  is  often  called  four-dimensional  data 
assimilation  to  emphasize  the  fact  that  observations  are  distributed  in 
time  as  well  as  space.  To  assimilate  data,  two  main  difficulties  must 
be  overcome.  Grid  point  values  have  to  be  approximated  from  the 
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observations,  which  are  irregularly  distributed  in  space  and  time  and 
vary  in  accuracy.  Objective  analysis  is  the  attempt  to  obtain  these 
appr  -xiaations  in  a manner  compatible  with  the  numerical  model.  Once 
a complete  set  of  approximate  data  at  the  grid  -joints  at  some  time  level 
is  assembled — perhaps  with  larger  errors  than  desired  because  of  incomplete 
measurements — there  is  the  numerical  difficulty  of  initialization  shock 
to  be  overcome.  This  shock  involves  the  rapid  growth  of  the  fast-moving 
gravity  waves  which  physically  should  have  small  amplitudes.  The  gravity 
,/aves  drown  out  the  slow-moving  but  physically  significant  Rossby  waves 
by  nonlinear  interaction.  A discussion  of  these  waves  for  a linearized 
version  of  system  (1.5)  is  given  in  Morel,  Lefevre,  and  Rabreau  (1971). 
Initialization  is  the  attempt  to  construct  an  altered,  compatible  set 
of  initial  data  which  does  not  yield  an  initialization  shock  when 
numerically  integrated  forward  in  time.  For  a linear  problem,  initialization 
can  be  thought  of  mathematically  as  the  attempt  to  project  the  given 
initial  data  into  the  subspace  of  initial  data  without  gravity  wave 
components . 

There  are  two  main  approaches  to  initialization.  Static 
initialization  is  the  attempt  to  use  relations  among  the  variables  which 
have  no  time  derivatives  to  derive  a balanced  initial  state,  often  by 
requiring  some  variables  to  be  compatible  with  other  variables.  For 
example,  we  may  want  a wind  field  compatible  with  a given  mass  field. 

Dynamic  initialization  is  the  attempt  to  use  the  properties  of  the 
numerical  prediction  model  to  imitate  the  adjustment  among  the  variables 
which  occurs  in  the  atmosphere.  This  involves  inserting  data  at  different 
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time  levels  into  the  numerical  model  as  the  numerical  integration  proceeds. 
The  hope  is  that  this  will  both  construct  a balanced  state  by  the  adjust- 
ment from  the  model  and  improve  the  accuracy  of  those  variables  which 
are  incompletely  or  inaccurately  measured.  Dissipation  in  the  numerical 
model  is  often  used  to  damp  the  gravity  waves. 

Good  summaries  of  recent  research  in  meteorological  data 
assimilation  are  available  in  Bengtsson  (1975)  and  McPherson  (1975). 

Because  of  the  volume  of  work,  we  cannot  mention  all  the  important  papers 
which  have  contributed  to  the  advances  in  this  area.  The  interested 
reader  is  encouraged  to  check  the  references  in  these  papers  and  in  the 
papers  in  the  following  summary. 

Early  approaches  to  initialization  concentrated  on  the  use  of 
balance  equations — equations  derived  by  setting  the  time  derivative  of 
the  divergence  of  the  velocity  equal  to  zero  in  the  divergence  equation. 

See  Charney  (1955),  Thompson  (1961),  and  Haltiner  (1971)  for  discussions 
of  balance  equations.  It  was  soon  noted  that  this  approach  did  not  pre- 
vent initialization  shock.  The  balance  between  the  wind  field  and  the 
mass  field  (i.e.  the  pressure  field)  required  by  these  equations  is 
only  approximately  valid  in  the  atmosphere.  Miyakoda  and  Moyer  (1968), 
Nitta  and  Hovermale  (1969),  and  Charney,  Halem,  and  Jastrow  (1969) 
proposed  the  procedure  of  dynamic  initialization  to  solve  both  the 
initialization  problem  and  the  problem  of  incomplete  data.  The  method 
used  was  intermittent  updating,  e.g.,  the  replacement  of  the  mass  field 
by  its  correct  values  at  various  times  as  the  numerical  integration 
proceeds  to  construct  the  wind  field.  In  the  following  few  years, 
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many  refinements  to  this  procedure  were  introduced  and  many  numerical 
experiments  were  performed.  See,  for  example,  Miyakoda  and  Talagrand 
(1971),  Talagrand  and  Miyakoda  (1971),  Williamson  and  Kasahara  (1971), 
Williamson  and  Dickinson  (1972),  Mesinger  (1972),  Kasahara  and  Williamson 
(1972),  and  Temperton  (1973).  Some  experiments  integrated  forward  and 
backward  in  time,  updating  the  mass  field  whenever  certain  time  levels 
were  passed  through.  Some  experiments  also  tried  to  construct  the  mass 
field  from  the  wind  field.  Morel,  Lefevre,  and  Rabreau  (1971)  performed 
experiments  using  data  from  the  space-time  manifolds  on  which  satellites 
gather  data. 

Although  some  success  was  obtained  with  dynamical  initialization, 
there  are  many  difficulties  yet  to  be  overcome.  Often  the  errors  in  the 
wind  field  decreased  to  a non-zero  asymptotic  value  when  the  mass  field 
was  updated.  The  optimal  frequency  of  insertion — the  length  of  time 
between  successive  updates  in  intermittent  updating — is  difficult  to 
determine.  Often  convergence  is  very  slow.  There  was  a marked  decrease 
in  the  effectiveness  of  the  method  when  real  data  or  data  with  errors 
was  used  instead  of  the  model-generated  data  used  in  simulation  experi- 
ments. The  wind  field  does  not  appear  to  adjust  to  the  mass  field 
in  the  tropics;  wind  observations  in  the  tropics  will  be  needed.  It 
was  found  to  be  difficult  to. use  data  from  the  space-time  manifolds  on 
which  satellites  gather  data  without  creating  an  initialization  shock 
at  each  time  level  at  which  some  data  is  inserted.  For  this  reason, 
McPherson  (1975)  suggests  that  intermittent  updating  is  preferable  to 
continuous  insertion.  We  also  direct  the  reader  to  the  recent  papers 
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of  Temperton  (1976),  Blumen  (1976),  Hoke  and  Anthes  (1976),  Davies  and 
Turner  (1977),  Blumen  (1977),  Miyakoda,  Strickler,  and  Chludzinski 
(1978)  and  their  references.  Talagrand  has  recently  studied  some  mathe- 
matical aspects  of  four-dimensional  data  assimilation  in  a more  general 
setting  and  has  derived  a general  criterion  for  convergence  of  data 
assimilation.  See  Talagrand  (1977)  and  Talagrand  (1978).  For  informa- 
tion on  some  of  the  current  statistical  techniques  employed  in  objective 
analysis,  see  Schlatter  (1975)  and  Schlatter,  Branstator,  and  Thiel 
( 1976) . 

Because  of  the  difficulties  encountered  in  dynamic  initialization, 
Ghil  (1973)  suggests  returning  to  a static  initialization  procedure. 

His  idea  is  to  derive  an  auxiliary  system  of  differential  equations 
directly  from  the  model  equations  being  used  which  do  not  have  time 
derivatives  of  the  wind  field  in  them.  Presuming  the  mass  field  and 
sufficiently  many  of  its  time  derivatives  are  known,  we  can  conceivably 
solve  these  diagnostic  equations  for  the  wind  field  at  some  time  level, 
yielding  a balanced  initial  state.  He  points  out  that  the  initialization 
shock  encountered  using  the  balance  equations  comes  from  the  fact  that 
they  are  not  compatible  with  the  model  equations,  having  been  derived 
using  an  approximation.  The  systems  Ghil  derives  are  similar  to  the 
balance  equations,  but  are  compatible  with  the  model  equations  since 
they  are  derived  from  them.  He  derives  such  a system  f-,r  the  shallow- 
water  equations  linearized  around  a state  of  rest  in  Ghil  (1973),  and 
for  the  shallow-water  equations  (1.5)  and  the  primitive  equations  in 
Ghil  (1975).  The  diagnostic  equations  for  the  shallow-water  equations 
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are  of  mixed  elliptic-hyperbolic  type}  the  type  of  those  for  the  primitive 
equations  is  harder  to  determine.  In  Ghil,  Shkoller,  and  Yangarber 
(1977),  numerical  experiments  are  performed  on  the  diagnostic  system 
for  the  shallow-water  equations  using  an  iterative  relaxation  scheme. 

The  results  are  reasonably  accurate,  except  when  there  are  large  regions 
of  hyperbolicity.  However,  since  two  time  derivatives  of  the  mass  field 
are  needed,  these  equations  can  only  be  of  practical  use  where  the  measure- 
ments of  the  mass  field  are  sufficiently  accurate. 

The  compatibility  obtained  by  Ghil' s approach  is  a substantial 
improvement  over  that  obtained  by  the  use  of  the  classical  balance 
equations,  but  it  does  not  directly  require  the  gravity  wave  components 
to  be  small,  particularly  if  the  data  used  has  errors.  The  problem  is 
that  the  model  equations  essentially  have  too  many  solutions — Rossby 
waves  and  gravity  waves — and  we  are  only  interested  in  the  slower-moving 
Rossby  waves.  This  problem  may  persist  in  using  initial  data  obtained 
from  diagnostic  equations  compatible  with  the  model  equations  unless  the 
data  used— the  mass  field  with  its  time  derivatives  and  boundary  conditions 
for  the  wind  field — come  from  a solution  of  the  model  equations  with  small 
gravity  wave  components)  this  assumption  is  not  guaranteed  in  view  of 
the  errors  of  observation  and  the  fact  that  the  model  equations  only 
approximate  the  atmospheric  motion  from  which  data  are  observed.  Kreiss 
has  developed  a general  method  using  asymptotic  expansions  to  essentially 
project  a given  set  of  initial  data  into  the  appropriate  subspace;  more- 
over, his  method  extends  to  non-linear  problems.  He  points  out  that 
the  fundamental  difficulty  is  the  existence  of  different  time  scales 
among  the  solutions  of  the  equations,  and  we  are  only  interested  in  the 
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slower  moving  motions.  His  method  modifies  the  initial  data  to  suppress 
the  fast  time  scales.  See  Kreiss  (1977)  and  Kreiss  (1978)  for  details. 

G.  Browning  has  applied  these  techniques  to  meteorological  systems  in 
work  to  appear. 

Ghil’ s work  demonstrates  that  the  wind  field  is  determined  by 
the  mass  field  and  its  time  history  (given  in  the  form  of  two  time 
derivatives)  and  appropriate  boundary  data  for  the  winds,  modulo  the 
theoretical  and  numerical  difficulties  encountered  in  solving  nonlinear 
equations  of  mixed  type.  By  measuring  a certain  nonstandard  set  of 
data,  we  can  construct  complete  initial  data  at  some  time  level.  If 
initialization  shock  still  occurs  because  of  errors  in  the  data,  Kreiss’ 
method  could  be  applied.  The  most  desirable  approach  to  solve  both  the 
incomplete  initial  data  problem  and  the  initialization  shock  problem 
would  be  to  go  directly  from  a sufficient  set  of  nonstandard  data 
which  is  possible  to  measure  to  the  initial  data  projected  into  the 
right  subspace.  This  may  require  less  data  than  to  pass  through  the 
intermediate  step  of  constructing  a complete  set  of  initial  data,  and 
then  modifying  this  initial  data.  The  best  way  to  do  this  is  not  known. 

A more  thorough  understanding  of  the  construction  of  initial  data  from 
nonstandard  data  for  similar  systems  of  equations  would  be  helpful  in 
the  attempt  to  solve  this  difficult  problem,  and  we  address  ourselves 
to  this  construction  in  this  thesis. 

We  have  made  several  choices  in  the  formulation  of  the  problem 
as  stated.  We  deal  with  linear  equations,  although  some  results  extend 
to  certain  nonlinear  equations.  We  consider  periodic  boundary  conditions 
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w 


to  eliminate  complications,  for  programming  convenience,  and  also  since 
the  torus,  like  the  sphere,  is  a compact  manifold  without  boundary. 

We  assume  that  measurements  of  u1  are  uniform  in  space.  We  construct 
the  complete  set  of  initial  data  at  time  t = 0 for  convenience} 
because  of  the  time-reversibility  of  system  (l.l),  we  can  view  our 
measurements  of  u*  for  times  t > 0 as  measurements  of  the  time 
history  of  u1. 


Other  Related  Work 

A somewhat  similar  problem  for  the  heat  equation  and  wave 
equation  is  discussed  in  a paper  by  Greenberg  (1963).  He  introduced 
a method  which  uses  data  at  different  time  levels  on  a course  mesh  to 
construct  initial  data  on  a finer  mesh.  He  constructs  a multi-level 
difference  scheme  on  the  coarse  mesh  which  has  the  property  that  there 
exist  initial  values  on  the  fine  mesh  such  that  if  the  numerical  integra- 
tion were  performed  on  the  fine  mesh,  then  the  computed  solution  would 
agree  with  Greenberg' s scheme  on  the  coarse  mesh.  However,  the  set  of 
initial  data  on  the  fine  mesh  which  is  compatible  with  the  given  data 
on  the  coarse  mesh  is  not  uniquely  determined.  His  method  uses  more 
data  than  necessary  to  achieve  higher  accuracy,  which  is  basically 
the  reason  one  constructs  multi-level  difference  schemes.  See  Kreiss 
and  Oliger  (1973)  for  an  analysis  of  such  schemes. 

Fattorini  and  Radnitz  (1971)  consider  the  problem  of  existence 
and  uniqueness  of  solutions  of  the  n-th  order  Banach  space  valued 
differential  equation 
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(t)  = Au(t) 


t > 0 


satisfying  initial  conditions 

u(k)  (0)  = for  k G a 

where  a is  a subset  of  {0, 1, . . . , n-1) , with  the  condition  that  these 
solutions  must  also  satisfy  an  estimate  of  the  form  ||u(t)||  = 0(eut) 
as  t -»  + oo.  Fattorini  (1973)  considers  the  same  question  on  a finite 
time  interval  without  the  energy  estimate.  Necessary  conditions  on  the 
operator  A are  discussed  in  terms  of  the  spectrum  cr(A)  and  the  growth 
of  the  resolvent  R(a,A)  for  there  to  exist  a (not  necessarily  unique) 
continuous  linear  map  from  the  data  to  the  solution.  In  some  cases, 
it  can  be  concluded  that  A is  bounded. 

In  a more  classical  vein,  there  has  been  interest  in  the  past 
in  the  Dirichlet  problem  for  the  wave  equation.  Bourgin  and  Duff in 
(1939)  proved  that  uniqueness  holds  for  the  Dirichlet  problem  for  the 
equation  u^t  = u^  in  a rectangle  with  sides  parallel  to  the  coordinate 
axes  if  and  only  if  the  ratio  of  the  sides  is  irrational.  More  recently, 
Young  (1971)  and  others  have  extended  this  result  to  more  general 
hyperbolic  equations.  See  the  references  in  Young  (1971)  and  Young 
(1972)  for  details.  These  results,  however,  do  not  include  the  con- 
tinuous dependence  of  the  solution  on  the  data. 


There  has  been  a growing  interest  in  other  related  Improperly 
posed  problems,  e.g.,  inverse  problems.  See  Payne  (1975)  for  a discussion 


of  some  of  these  and  a good  bibliography.  Our  approach  to  the  stated 
problem  Is  to  treat  it  as  much  as  possible  as  a well-posed  problem  by 
obtaining  a priori  estimates  from  which  we  can  conclude  uniqueness  and 
continuous  dependence.  Preliminary  results  of  our  work  were  reported 
in  Bube  and  Oliger  (1977). 


Summary  of  Results 

The  main  results  of  this  thesis  are  given  in  Theorems  4.9,  4.12 
through  4.16,  5-6,  and  5.8.  They  show  that  under  certain  conditions  on 
system  (1.1)  (mainly  involving  the  linkage  between  u1  and  u11), 
measurements  of  either  u and  u^  at  time  t = 0 or  u (t)  for 
0 < t < tp,  combined  with  measurements  of  either  u11  at  t = 0 for 
x restricted  to  a k-1  dimensional  hyperplane  or  a finite  number  of 
Fourier  coefficients  of  u11  at  t = 0,  are  sufficient  to  uniquely 
determine  u1*  at  t = 0 in  a manner  which  depends  continuously  on 
the  measured  data.  These  results  demonstrate  that  it  is  possible  to 
reduce  the  data  requirements  on  u^*  if  sufficient  information  about 
u1  is  available.  The  results  requiring  measurements  of  u*(t)  for 
0 < t < tQ  give  a theoretical  justification  for  the  attempt  to  use 
intermittent  updating  for  hyperbolic  systems  satisfying  the  necessary 
conditions. 


In  Example  4.11,  we  show  that  for  the  shallow-water  equations 
(1.5),  the  geopotential  <p(x,y,0)  is  determined  by  the  winds  u and 
v and  either  u^  and  cp(C,y,0)  or  vt  and  cp(x,0,0).  In 
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Theorem  5-10,  we  show  that  for  the  linearize.d  shallow-water  equations 
with  constant  coefficients,  the  winds  u and  v can  be  determined  from 
the  geopotential  <p(x,y,t)  for  0 < t < tQ  provided  that  the  Coriolis 
parameter  f is  not  zero. 

We  also  examine  the  method  of  intermittent  updating  for  a sample 
two-by-two  equation  of  the  same  form  as  the  linearized  shallow-water 
equations  for  one-dimensional  flow.  The  results  suggest  strongly  that 
the  use  of  different  frequencies  of  updating  is  important  to  avoid 
slow  convergence. 

We  now  outline  the  rest  of  the  thesis.  In  Chapter  II, 
we  introduce  the  notations  and  the  function  spaces  we  will  use,  and 
present  the  necessary  background  results  on  pseudo-differential  operators 
and  hyperbolic  systems.  In  Chapter  III,  we  discuss  question  (1.3) 
for  a sample  two-by-two  equation  to  motivate  the  more  general  results 
by  understanding  what  approaches  work  and  what  approaches  do  not  work 
for  this  sample  equation. 

Chapters  IV  and  V present  answers  to  question  (1.3)  for  more 


general  equations.  We  discuss  approaches  using  time  derivatives  of 
u in  Chapter  IV  and  approaches  which  do  not  use  time  derivatives  of 
u1  in  Chapter  V. 

In  Chapter  VI,  we  discuss  computational  methods  for  the  sample 
equation  of  Chapter  III,  including  an  analysis  of  intermittent 
updating. 
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CHAPTER  II 


PRELIMINARIES 


In  this  chapter,  we  introduce  the  notations  and  the  function 
spaces  that  we  will  use  and  present  the  background  results  on  hyperbolic 
systems  that  we  will  need. 


Notation 

The  transpose  and  conjugate  transpose  of  a vector  or  a matrix 

a will  be  denoted  by  a'  and  a*,  respectively.  (x,y)  = y*x  will 

k k 

denote  the  usual  inner  product  for  vectors  x,  y £ (E  . For  x £ X , 
the  norm  of  x is  |x|  = (x,x)  ' . For  a matrix  A,  the  norm  is 


given  by 


I A | = sup  | Ax | 

M=i 


We  will  also  use  |x|  = sup 


!<  j<k 


lAL  = ,svp  iAxl 

X =1 


If  Q is  an  open  set  in  IR^  , L2(n),  C°°(0),  o(si),  and  £)'  (ft)  will 
denote  the  usual  spaces  with  their  usual  topologies,  as  in  Rudin 
(1973)  or  Yosida  ( 197*0;  we  will  also  use  these  to  denote  spaces  of 
(En-valued  functions,  each  of  whose  components  belongs  to  the  appropriate 


h ‘t*i  s-V 


space.  For  (En-valued  functions,  the  inner  product  in  L2(f2)  is 
given  by 

(u,v)  = / v*(x)  u(x)dx 

n 

If  cp  6 X,  a locally  convex  topological  vector  space,  and 
u € X' , the  dual  space  of  X,  then  we  defined 

(2.1)  (cp,u>  = u(cp) 

We  will  use  this  notation  in  particular  when  X = o(fl). 


Periodic  Distributions 

We  will  be  considering  distribution  solutions  of  the  system  (l. 1). 
The  concept  of  periodic  boundary  conditions  for  functions  can  be  extended 
to  distributions  in  two  natural  ways.  Fortunately,  there  is  a natural 
identification  between  these  two  extensions.  We  refer  to  Rudin  (1975), 
Chapter  7,  Exercise  22. 

If  y € TR  , the  translation  operator  t on  functions  cp 

y 

defined  on  ]R  is  given  by 


(t  <P)U)  = cp(x-y) 


If  T^cp  = cp  for  each  | £ Z , we  say  that  cp  is  periodic. 
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denotes  the  torus  E^  /Z^.  d(T^)  is  the  subspace  of  all 

oo,  k k 

(p£C  (E  ) which  are  periodic.  Convergence  in  d(T  ) is  uniform 

convergence  of  the  function  and  all  its  derivatives,  o' (T  ) is  the 
K 2 ✓ k 

dual  space  of  d(T  ).  L (T  ) denotes  the  Hilbert  space  of  all  periodic, 
2 k 

locally  L functions  defined  on  E , with  the  inner  product 


(2.2) 


(u,v)  = / u(x)  v(x)  dx 

*k 


where  is  the  open  unit  cube  (0,1)  in  E . Functions  u £ L2(Tk) 

k k 

can  be  viewed  as  elements  of  d'(T  );  for  cp  £ d(T  ),  define 


(cp,u)  = / T)(x)  u(x)dx 

A distribution  v £ o' (E^  ) is  said  to  be  periodic  if  for 
each  iji  £ d(E^  ) and  £ £ Zk, 


(t^\|i,v)  = (\|(,v) 


Dj^(E  ) denotes  the  subspace  of  all  v £ o'  (E*  ) which  are  periodic. 

The  identification  between  elements  of  d' (T  ) and  elements  of 


D^(E  ) is  as  follows.  Let  denote  a function  in  d(®  ) such 

that  0 < < 1 and 


Define 


E T 5 = l 
gez  5 i 


?k(x)  = n Ct(xJ 
K .i=l  1 J 


for  X £ E . 
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Then 


(2.3) 


1 


Define  the  continuous  linear  operators  G;£>(Tk)  -*£>(]Rk)  and 
k k 

J:d(®  ) -»  d(T  ) by  Gp  = ^9  and  J\|/  = The  dual  operators 

G':D' (1R  ) -»  o' (T  ) and  J'  :jd' (Tk)  -»jr)'(]Rk)  are  given  by  G'v  = v°G 

and  J'u  = uoj.  The  range  of  J'  is  contained  in  Dl(Bk).  If  G; 

denotes  the  restriction  of  G'  to  D^(]Rk),  then  G^  and  J'  are 

inverse  mappings,  yielding  the  desired  identification.  Note  that  Gi 

# 

is  independent  of  the  choice  of  £ , depending  only  on  (2.3).  By  this 
identification,  we  can  view  elements  of  o' (Tk)  as  periodic  distributions. 

If  9 € 1/(0^)  or  L2(Tk),  the  Fourier  coefficients  of  9 
are  defined  by 


(2*M  $(!)  = / e 27ri(x>^)  9(x)  dx  for  [ 6 Ik 

Sc 

and  the  Fourier  series  of  9 is 

(2.5)  Zk<J>(s) 

|6Zk 

For  u € o'(Tk),  define 

(2.6)  u(|)  = <e“2n’i^x'5^u>  for  % £ Zk 
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If  v € we  define  9(|)  = u(|)  where  u = G'v.  This  is 

consistent  with  (2.h)  if  v is  in  L2(Tk).  If  u G D*  (*)  and 
cp  G D(Tk),  then 

(2.7)  <<P,u)  = E k $(|)  u(-|) 

|GZ 

If  u,  v G L2(Tk),  then 

(2.8)  (u,v)  = E k u($)  v(|) 

|€Z 

For  any  real  number  s,  define 

(2.9)  lluf  = Z (1  + |||2)S  |u(?)|2 

8 |GZk 

The  Sobolev  space  Hs(Tk)  is  the  space  of  all  u G d' (Tk)  for  which 

||u||  < ®.  The  topology  of  d(T  ) is  the  same  as  the  topology  induced 

by  the  seminorms  |M|  for  s = 1,2,3,...  . From  this,  we  see  that 

s 

D' (Tk)  = U HS 

-oo  < s < oo 

Every  g defined  on  Z such  that 

1.  (1+  III2)8  U(OI2<» 

|€Zr 

for  some  s £ ]R  is  the  Fourier  transform  of  a u £ D' (T  ) given  by 

(<p,u)  = Z . <p(s ) g(-|) 

?ez 
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We  can  extend  the  L2(T^)  inner  product  to  distributions  u € H ^(Tk) 

s?  k 

and  v € H (T  ) using  (2.8),  provided  that  s^  + s^  > Oj  under  the 
same  assumptions,  we  also  define 

(2.10)  (u,v)  = £ . u(|)  v(-6) 

|£Z 

For  an  integer  s > 0,  Hs(Tk)  is  the  set  of  all  distributions 
in  D* (T  ) whose  distribution  derivatives  up  to  order  s are  all  in 
L2^).  See  Agmon  (19^5)  for  a discussion  of  Lc  derivatives.  The 
importance  of  Sobolev  spaces  stems  from  the  following  result. 

2.1.  Sobolev’ s Lemma.  If  u £ HS(Tk)  and  s > (k/2)  + m for  some 

integer  m > 0,  then  there  is  a function  u^  £ Cm(Tk)  such  that 

u = u^  a.e.  The  induced  inclusion  map  from  HS(Tk)  into  Cm(Tk)  is 
continuous. 

We  now  consider  functions  and  distributions  depending  on  the 
time  variable  t as  well  as  the  space  variables  x € ]R  . Let 
= JRkx  (0,1)  and  £.  = Tk  x (0,1).  d(E.)  denotes  the  space  of 

all  <p  £ (f° ( :^)  which  are  periodic  in  x and  whose  support  in  t 

is  a compact  subset  of  (0,1),  topologized  as  the  inductive  limit  of 
the  subspaces  of  C°°(  f^)  whose  elements  are  periodic  in  x and  have 
fixed  compact  support  in  t.  This  is  similar  to  the  usual  construction 
of  the  topology  on  d(o)  as  in  Yosida  (19?4).  d' (£k)  is  the  dual 
space  of  o(£k).  D 
which  are  periodic  in  x.  As  before,  there  is  a natural  identification 
between  elements  of  o'  (E^)  an(i  elements  of  D^(^). 


jj(  denotes  the  subspace  of  all  v £ D'  ( f^) 


From  here  on,  we  will  overlook  the  technical  distinction  between 
elements  of  £>' (T  ) and  o'(3R  ) and  that  between  elements  of  d’ (Z.) 
and  d'  ( . 

The  distributions  in  e'(Zk)  that  we  will  be  interested  in  can 
be  considered  as  continuous  (or  at  least  integrable)  functions  from 
[0,1]  into  HS(Tk)  for  some  s.  C([0,1],  Hs(Tk))  is  the  Banach  space 
of  all  continuous  Hs(Tk)-valued  functions  u(t)  defined  on  [0,1]  with 
norm 

(2.11)  ||u||  = sup  ||u(t)  || 

0^t<l  s 

P g 

L ((0>1)>  H (T  ) ) is  the  space  of  all  Bochner  square-integrable 
s / k 

H (T  ) -valued  functions  u(t)  defined  on  (0, 1)  with  norm 

(2.12)  ||u||  2 = (f1  ||u(t)||^  dt)l/2 

S>d  0 3 

See  Yosida  (197*0  for  a presentation  of  the  Bochner  integral.  If 
p s k 

u 6 L ((0, 1),  H (T  ^),  we  can  view  u as  a distribution  in  e ' (Z.)  or 

K 

k+1 

in  d' (T  ) defined  by 

1 

(2.13)  (cp,u)  = / (cp(t),u(t))  dt 

0 

k+1 

for  cp(x,t)  € o(T  ) where  <p(t)  denotes  the  function  of  x in 
d(T  ) obtained  by  holding  t fixed.  We  will  say  that  a distribution 
V in  D'(Ek)  or  in  d' (Tk+1)  is  in  L2((0,l),  HS(Tk))  if  there  is 

O i_ 

a u € L‘ ((0,1),  H (T  ) ) for  which  (2.13)  yields  the  same  distribution. 

Similar  statements  hold  for  C([0,1],  Hs(Tk) ) . 
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Pseudo-differential  operators  on  a torus 


The  theory  of  pseudo-differential  operators  is  a powerful  tool 

for  handling  linear  partial  differential  equations  with  variable 

coefficients.  We  present  here  a theory  of  pseudo-differential  operators 

based  on  Fourier  series  instead  of  Fourier  transforms.  The  constructions 

are  almost  identical,  and  we  refer  to  Taylor  ( 197*0  for  details.  We 

00 

define  symbols  to  be  C in  the  dual  variable  £ as  in  the  transform 
case  to  carry  over  the  results  depending  on  asymptotic  expansions. 

The  main  difference  between  operators  based  on  series  and  those  based 
on  transforms  involves  the  discrete  nature  of  the  dual  variables  in  the 
series  case.  When  new  symbols  are  constructed  in  the  theory,  we  have 
to  manipulate  them  into  a form  which  defines  them  for  all  5 c 1 ; 
this  is  necessary  since  exp(27ri(x,  | ) ) is  periodic  only  for  | £ Z . 
However,  in  dealing  with  symbols  which  are  periodic  in  x,  we  never 
have  to  worry  about  compact  support,  so  many  of  the  technicalities  of 
the  theory  are  simplified. 


2.2.  Definitions.  A multi-index  is  an  element  a of  Z with  non- 
negative components.  |a|  denotes  ay  and  denotes 

It  OL 

IIj=1  (aj’)>  denotes  the  partial  differential  operator 

Jal 


. “l  A ^ 

ox^  • • • ox^ 


a. 


lr  a 

For  5 € I , I denotes  ' We  de^dne 


s' a 


(2ri) 


a 
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For  u € d' (T  ),  we  have 

(2.14)  D°u(|)  = |a  (1(0  for  5 € Zk  . 

2.3.  Definition.  Let  mO.  Sm(Tk)  is  the  set  of  p(x,|)  G C°°(TkxIk) 
(the  set  of  functions  <p(x,|)  € C°°(lRk  x 3Rk)  which  are  periodic  in  x) 
such  that  for  each  pair  of  multi-indices  a and  there  is  a constant 

Ir 

o such  that  for  all  f € IR 

a,  p 

(2.15)  sup|D^o"  p(x,|)  I <Ca>p(l+ 

This  corresponds  to  the  case  p = 1 and  5=0  in  Taylor's  notation. 
p(x,|)  is  called  a symbol.  We  also  allow  matrix  symbols. 


2.4.  Definition.  If  p(x,|)  G Sm,  define  the  operator  P = p(x,D) 
on  o(Tk)  by 

(2.16)  (Pu)(x)  = E e2T,1(x>? ) p(x>|)  Q(|) 

|Glk 

p(x,£)  is  called  the  symbol  of  the  operator  P and  is  often  denoted 
Op.  m Is  called  the  order  of  the  symbol  or  the  order  of  the  operator. 
P is  called  a pseudo-differential  operator.  We  write  P G PSm. 
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•5-  Theorem.  If  p(x,£)  r S , then  P = p(x,D)  is  a continuous 
linear  operator  from  o(Tk)  into  itself.  P can  be  extended  to  a 
continuous  linear  operator  from  (Tk)  into  itself,  using  either  the 
strong  dual  topology  or  the  weak*  topology  of  D'(Tk). 


i\6.  Theorem.  Let  (iik)  be  a decreasing  sequence  of  real  numbers 

m 

with  m.  oo.  Suppose  p.(x,0  € S J.  Then  there  exists  a 
m J 

p(x,5)  € S such  that 


(2.17) 


P - E p,  € S v for  v = 1,2, 
j < v J 


When  (2.17)  holds,  we  write 


p ~E 


and  we  say  that  p is  an  asymptotic  sum  of  the  p 's. 

J 


2.7.  Theorem.  The  product  of  two  pseudo-differential  operators  is 

m. 

again  a pseudo-differential  operator.  If  r(x,|)  € S 1 and 

, , ra2  +mo 

q(x,|)  € S , then  P = q(x,D)  r(x,D)  € PS  1 2 and 


(2.18) 


<Tp(x,T!)  = E f e27Ti(x-V’^  q(x,g  + q)  r(y,q)  dy 


?ezk  Tk 


In  addition, 


(2.19)  orp(x,!)  ~ £ jjr  q(x,|)  d“  r(x,|) 

a>0  • 1 x 

nL+m  -|a| 

The  term  corresponding  to  a in  this  expansion  is  in  S 


2.8.  Definitions.  If  A:o(Tk)  -> o(Tk)  is  a continuous  linear 

k lr 

operator,  then  the  dual  operator  A' :e’ (T  ) ->  D* (T  ) is  defined  by 

(2.20)  (u,  A'v)  = (Au,  v) 

The  adjoint  operator  A*:d' (Tk)  -» o' (Tk)  is  defined  by 

(2.21)  (u,  A*v)  = (Au,  v) 

This  defines  A*v  as  an  element  of  o' (T  ) since 

(2.22)  (cp,\|0  = (cp,\|r> 

for  cp  £ o(Tk)  and  \|r  £ o' (Tk).  If  A*  = A,  we  say  that  A is 
self-adjoint. 


P5 


2.9.  Theorem.  The  dual  operator  and  the  adjoint  operator  of  a pseudo- 
differential operator  are  pseudo-differential  operators.  If  p(x, |)  £ Sm 
and  P = p(x,  D),  then  P'  and  P*  are  in  PSm  and 


(2.23) 


>,  (x,|)  = Z , / e""i^x"y’r1^  p(y,  -l-T))*  dy 


n £ Z Tk 


(2.2h) 


ap*(x,|)  = Z k fk  e^in(x~y’’‘)  p(y,|  +tj)*  dy 
T)  C.  Z T 


In  addition, 


(2.25) 


„(x,s)  - 51  4 a»P(«,  -*)■) 


a!  | x 


(2.26) 


Op*(x,|)  ~ Z rr  ^?d“p(x,|)* 
r a>0  * 5 


The  term  corresponding  to  a in  each  of  these  expansions  is  in 
qm-M 


mi  ®o 

2.10.  Corollary.  If  r(x,£)  2 S and  q(x,|)  € S , then 

m +m  -1 

oftR(x,|)  - q(x,|)  r(x,|)  £ S * . If  the  symbols  r(x,|)  and 

H nu+m  -1 

q(x,|)  commute,  then  [Q,R]  £ PS  . where  [Q, R]  = QR  - RQ 

is  the  commutator  of  Q and  R.  If  p(x,£)  £ Sm,  then 


n (x,5)  - p(x, |)*  £ S . If  p(x,  |)  is  a Hermitian  matrix  for 


all  x and  | , then  P*  - P £ PS  ” . 
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2.11.  Theorem.  If  p(x, |)  € Sm  and  s 6 IR , then  p(x,D)  is 

g S “IH  It 

a continuous  linear  operator  from  H (T  ) into  H (T  ) . 

2.12.  Definitions.  If  p is  an  n x n matrix,  define  Re  p = (p  + P*)/2. 
If  (pv,  v)  > c(v,v)  for  all  v € (En,  we  write  p > cl  or  just  p > c. 

If  P is  a continuous  linear  operator  on  fi(T  ) which  extends  to  a 

If 

continuous  Linear  operator  on  d'(T  ),  define  Re  P = (P  + P*)/2. 


2.13.  Theorem  (Gardlng's  Inequality).  If  p(x, |)  is  an  n x n matrix 

symbol  in  and  Re  p(x,| ) > Cq  > 0,  then  for  any  s < 0 and 

e > 0,  there  exists  a constant  c such  that 

S,€ 


(2.27) 


Re(Pu,u)  > (c  - c ) 


i 


for  all  u € L2(T^)  = H^(Tk).  See  Theorem  7.3  in  Strikwerda  (1976) 
for  a proof  allowing  matrix  symbols. 


2.1b.  Definition.  Let  a(5  ) * (l+  |t|2)  . For  s € E , let 

s s s 

A (f)  = (a(|))  . We  will  use  A to  denote  both  the  symbol  and  the 
operator  obtained  from  this  symbol  as  in  (2.16). 

AS  is  an  isometry  of  Hr(Tk)  onto  Hr“S(Tk).  If  u€Hs(Tk), 

then 


I 


(2.28) 


K - IIasu||_  - (a3u,  a'u)1/2 


The  operator  AS  Is  self-adjoint,  and  a^2  . a*1**2 


First  order  hyperbolic  systems 

We  consider  equations  of  the  form 


(1.1) 


ut  ~ ^ A (x,t)u  + B(x, t)u  + f(x.t) 
j*i  J xi 


»e  assume  here,  without  loss  of  generality,  that  the  time  interval 

»e  are  Interested  in  is  (0,1],  We  adopt  the  following  conventions. 


x e b 

k 

t 

n 

t 


denotes  the  space  variables  x , x 

1 1 k 

is  the  number  of  space  dimensions 
is  the  time  variable 
is  the  number  of  components  in  u 
is  the  number  of  measurable  components. 


For  yeV>,  y1  denotes  (y^.  ...  , y<).  and  y11  denotes 

^yf+l'  "■  ’ I'n^  • Aj(x,t)  and  B(x,t)  are  real-valued  n xn 
matrix  functions,  periodic  in  x and  c"  in  x and  t for  x € ]Rk 

and  t in  a neighborhood  of  l 0, 1 J . We  assume  that  f € L2( (0, 1) ,HS(T 

some  s € IR  . (1.1)  is  to  be  understood  as  an  equality  of  distri- 

butions. Define 
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(2.29) 


i (x,t,|)  = 2vi  Z A (x,t)| 
j=l  3 3 

£0(x,t,i)  = B(x,t) 


Viewing  t as  a parameter,  (l.l)  becomes 


(2.30) 


ut  = + Lo^^u  + f 


2.15.  Definition.  A symmetrizer  for  £ (x,t,|)  is  a smooth  one- 
parameter  family  of  n x n matrix  symbols  r(x,t,|)  £ S°(Tlt)  depending 
on  the  parameter  t £ [0,1]  such  that  r(x,t,|)  is  Hermitian  and 
homogeneous  of  degree  0 in  | for  |||  >1, 


(2.31) 


r(x,t,| ) > c > 0 for  |? | > 1 


for  some  constant  c,  and 


(2.32) 


r(x,t,| ) £1(x,t,|)  + iL(x,t,|)*  r(x,t,|)  = 0 


2.16.  Definitions.  System  (l.l)  will  be  called  symmetrizable  hyperbolic 
or  s-hyperbolic  if  i1(x,t,|)  given  by  (2.29)  has  a symmetrizer. 

System  (l.l)  will  be  called  symmetric  hyperbolic  if  the  matrices 
Aj (x, t)  are  symmetric.  System  (l.l)  will  be  called  strictly  hyperbolic 
if  the  eigenvalues  of  £1(x,t,|)  are  purely  imaginary  and  distinct 
for  |^0. 


-/•  r- 


Clearly  symmetric  hyperbolic  systems  are  s-hyperbolic.  Strictly 
hyperbolic  systems  are  also  s-hyperbolic;  see  Taylor  (1974)  for  a 
construction  of  a symmetrizer.  We  can  also  show  that  system  (l.l)  is 
s-hyperbolic  if  ^(x.t,!)  is  diagonalizable,  has  purely  imaginary 
eigenvalues,  and  has  eigenvectors  which  are  C°°  in  x,  t,  £ except 
where  | = 0;  if  p(x,t,|)  denotes  the  matrix  whose  columns  are  these 
eigenvectors  for  |||  =1,  extended  to  be  homogeneous  of  degree  0 in 
|,  and  <p(|)  is  a C°°  cut-off  function  which  is  0 near  | = 0 and  1 
fbr  |4|  >1,  then  r(x,t,|)  =q>U)  (p(x,  t,  | )-1)*  p(x,  t,  | )_1  is  a 
symmetrizer. 

2.17.  Theorem.  If  system  (l.l)  is  s-hyperbolic,  then  the  initial 
value  problem  for  the  system  is  well  posed:  given  uQ  € H (T  ) and 
f € l/~((0,l),  HS(Tk)),  there  exists  a unique  solution 

u€  C([0,1],  HS(T^))  of  system  (l.l)  satisfying  u(0)  = uQ;  in  addition, 
there  is  a constant  K independent  of  u^  and  f such  that 

(2.33)  Ml,,.  < «lk0lls  + llflls>s) 

The  proof  involves  deriving  an  energy  inequality  using  the 
symmetrizer  and  then  using  this  inequality  with  some  functional 
analysis  to  obtain  the  result.  See  the  sections  on  symmetric  hyperbolic 
systems  and  strictly  hyperbolic  equations  in  Taylor  (1974). 


4-  - - — 

r -t 


30 


■ 


i 


Vt'.i. 


I 

w 


* 


Returning  to  our  discussion  of  the  questions  (1.3)  and  (1.4) 
in  Chapter  I,  we  recall  that  we  take  the  perspective  that  there  is  a 
solution  u of  the  system  (l.l),  we  know  certain  information  about  u, 
and  we  want  to  determine  u.  We  have  chosen  to  pursue  the  intermediate 
problem  of  constructing  initial  data  at  time  t = 0.  A natural  question 
arises:  does  it  make  sense  to  discuss  the  initial  data  of  a distribution 
solution  u€  c' (l^)  of  system  (l.l)?  The  answer  is  affirmative,  as 
Theorem  2.20  will  show.  The  proof  of  Theorem  2.20  is  similar  to  the 
methods  which  Taylor  (1974)  uses  to  handle  elliptic  boundary  value 
problems.  We  will  need  some  auxiliary  spaces  of  distributions  in  the 
proof. 


2.18.  Definition.  For  real  numbers  s and  m,  define 


(2.34) 


"“"<=.»>  ' ,EV 

T c Z 


zv  U + Ul2)s  d + N'T  lact.x) 


2xm 


where  | is  the  variable  dual  to  x and  t is  the  variable  dual  to  t. 
HS,m(Tk+'*')  is  the  space  of  all  u € D'(Tk+1)  for  which  . ||u||^g  m)  < 00  • 
Note  that  these  norms  are  different  from  ||u||  and  ||u||  _ as 

S ,°°  Syd 

defined  in  (2.11)  and  (2.12). 

If  s < m < 0,  then 


(1  + Isl  )8  (1  + hi  )”  < (i  + III2)"  (1  + hi2)' 

< a + lil2  * hi2)” 
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•y 


* 


so 


■r 


TT 


so  H®(Tk+1)  C HS'm(Tk+1).  Also,  if  u€HS'm(Tk+1)  and  <p  6 fl(Tk+1), 
then  cpu  € Hs,m(Tk+1). 


2.19. 

Lemma. 

(i) 

If  u G H£ 

i.m^k+l 

(ii) 

l2((o,i), 

HS(Tk) ) 

(iii) 

Hs>  ^■(Tk+1) 

c C([0. 

Proof. 

= Z i 

|2>  s 


i2xm 


|GZ 

TtZ 


+ E (i  + |||2)S  (i  + |i|2)m  |t|2  |u(|,  r) |‘ 

|GZk 


tGZ 


llullf  . - IluJ'2 

« J 


(2tt)2  * s'm 


and  (i)  follows. 

Suppose 
k+1 

in  d'(T  ).  Define 


Suppose  u € L ((0, l),Hs(Tk)).  We  view  u as  a distribution 


(2.35) 


u(?,t)  - „(»))  . 


Then 


||u(t) J|j  = Z (1  + |||2)s  |u(|,t)|2  . 


|GZ 


Hence 
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y . /.*, 


- a ' ' T*  **V  . lir  fj 


Z (1  + |e|2)S  / |u(|,t)|2dt  = f ||u(t)||2  dt  < °° 

|€Zk  0 0 


so  u(|,t)  € n (0, 1)  for  each  g.  Now 


a(i,x)  . (e-2n«*.5)+tT)j  u j 

■ ! u(t))  dt 


r -27rf.tT  /.  .» 

= / e u(|,t)  dt 


so  Q(|jT)  is  the  r-th  Fourier  coefficient  of  u(g,t).  By  Parseval's 


relation 


/ |u(i,t) | dt  = Z |u(g,r) | , 

0 tGZ 


|u||"  = Zk(l+|||  )S  Z |G(g,x)  |2  < °°,  proving  (ii), 

3,0  |GZK  tGZ 


e "1  lr4-  "1 

Suppose  v € H ’ (T  ) . Then 


Z (1  + |||  )S  Z (1  + | t | ; |^(|, t)|2  < « 

|GZk  tGZ 


So  for  each  |,  the  function 


g(|,t)  = Z e27ritT 
tGZ 


is  in  HL(TL)  and 


E (i  + |||2)S  l|g(s,-)ll?<-. 

k 1 

|GZ 
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By  Sobolev's  Lemma,  g(|,t)  £ C[0, 1]  for  each  |,  and 


Hs(s»*)llo,  = SUP  |g(?>t)|  < c||g(|,*)IL 

0 < t < 1 1 

for  some  constant  c.  Hence 

(2.36)  Z d+  III2)8  llgU,-)£<» 

|£Zk 

For  each  t £ [0,1],  define  u(t)  € o' (Tk)  by  setting  the  |-th 
Fourier  coefficient  of  u(t)  to  be  g(|,t).  By  (2.36),  u(t)  £ Hs(Tk) 
for  each  t.  Since  each  g(|,t)  is  continuous,  the  dominated  convergence 
theorem  and  (2.36)  imply 

||u(t)  - u(t0)||^  = £ (1  + |||2)s  |g(|,t)  - g(s,tQ)|2  -0 

|£Zk 

as  t -»  6^.  So  u£  C([0,1],  Hs(Tk)).  If  u(|,t)  is  defined  by 
(2.35),  then  u(|,t)  = g(|,t)  by  construction.  So  u(|,t)  is  the  r-th 
Fourier  coefficient  of  u(|,t)  = g(|,t),  which  is  tf(|,t).  Hence 
u = v,  proving  (iii). 

2.20.  Theorem.  If  u £ D'(£k)  is  a distribution  solution  of  system  (l.l) 
and  this  system  is  s-hyperbolic,  then  u £ C([0,1],  H°(Tk) ) for  some 
a € ]R  . 


Proof.  Suppose  u£  ( f^)  is  a distribution  solution  of  (1.1). 

Given  e > 0,  choose  a cpQ ( t ) € d(0,1)  such  that  0 < cpQ  < 1, 

cpQ(t)  = 0 near  t = 0 and  t = 1,  and  cpQ(t ) = 1 for  e/2  < t < 1 - e/2. 

k+1 

Extend  the  distribution  cpQu  to  be  periodic  in  tj  then  cpQu  £ ), 

k+1 

so  we  can  view  cp^u  as  an  element  of  d'(T  ).  Now 

(2.37)  (<PQu)t  = + L0(t))(<P0u)  + cpQf  + cpQ  u 

i t 

since  cpQ  is  independent  of  x.  By  assumption,  f € L2((0,1)  Hs(Tk)) 

s 0 k+1 

for  some  s,  so  cp^f  E H ’ (l  ) by  Lemma  2.19  (ii).  Since  cpQu 

and  cpQ  u are  in  d'CT^1),  they  are  in  some  Sobolev  space  Hr(Tk+1). 
t 

We  assume  without  loss  of  generality  that  r € Z,  r < 0,  and  r < s. 

Then  cpQf  £ Hr,0(Tk+'L) , and  cpQu  and  cpQ  u are  in  Hr> r(Tk+1).  Hence 

by  (2.110  (LL(t)  + L0(t))(q>0u)  € HP’1'  r(Tk+1),  so  by  (2.37), 

(<P0u)t  6 Hr'1,r(Tk+1).  By  Lemma  2.19  (0,  cpQu  € Hr~1’ r+1(Tk+1) . 

Choose  cp^,  ...  , cp  r € d(0,  1)  such  that  0 < cp^  < 1,  the 

support  of  cpj  is  contained  in  the  set  where  <p  ^ s 1,  and 

cp  = <p_r  si  in  a neighborhood  of  [e,  1-e].  We  can  view  cp^u  as 
k+1 

an  element  of  d' (T  ),  and 


(2.38) 


((PJu)t  * (Lx(t)  + LQ(t))(cPjU)  + cpjf  + qpj  u 


Also,  cpjU  = cp^cp^u)  and  cp^  u = cp^  (cpj  ^u).  So  cp^u  and 
qp1  u € Hr"1,r+l(Tk+1),  and  (2.38)  implies  (qp^e  Hr“2’ r+l(T^+lj 

(if  r < -1).  By  Lemma  2.19  (i),  cpjU  £ Hp“2,rf2(Tkfl).  Continuing 
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_2T— lr+1 

by  induction,  we  obtain  cpu  G H ’ (T  ),  and  by  Lemma  2.19  ( iii) , 
fpu  G C( [ 0, 1] , H^r"I(Tk)). 

I Hence,  given  e > 0,  there  is  a ct  < s (where  f G L'  ( (0, l) ,HS(Tk) ) ) 

k a k 

such  that  u restricted  to  3R  X (e,  1-e)  is  in  C((e,l-e],  H (T  )). 

Let  a be  the  cr  corresponding  to  € = l/b.  Let  vQ  = u(^)  £ H0^!^). 

Since  s-hyperbolicity  clearly  does  not  depend  on  the  sign  of  the  time 
variable  t,  we  can  apply  Theorem  2.17  going  both  forward  and  backward 
in  time  from  t = l/2  to  obtain  a solution  v G C([0, 1],  Ha(Tk) ) of 
system  (l.l)  satisfying  v(l/2)  = vQ.  We  claim  that  v and  u are 
the  same  distribution  in  D#  ( . It  suffices  to  show  that  for  each 
t > 0,  v and  u agree  in  IR  x (e,l-e).  But  this  follows  directly 
from  the  uniqueness  part  of  Theorem  2.17,  applied  with  s equal  to  the 
smaller  of  <r  and  a.  We  conclude  that  u 6 C ( [ 0, 1 ] , Ha(Tk)), 
proving  Theorem  2.20. 

Theorem  2.20  shows  that  if  system  (l.l)  is  s-hyperbolic,  then 
every  distribution  solution  of  (l.l)  in  q' Is  *-n  C JO,  l],Hs(Tk) ) 
for  some  s G ]R  . We  can  thus  restrict  our  attention  to  solutions  u 
which  are  in  C([0,1],  HS(Tk))  for  some  s G ffi.  To  determine  u, 
it  suffices  to  determine  u(0)  as  an  element  of  H (T  ). 


We  remark  that  if  u G C( [ 0, 1] , HS(Tk))  and  f G C( [0, l],HS"I(Tk) ), 
then  £ C( [ 0, 1 ] , Hs  ^(T  )),  so  it  makes  sense  to  view  u^(0)  as  a 
distribution  in  D'(Tk). 


CHAPTER  III 


AN  ILLUSTRATIVE  EQUATION 

In  this  chapter,  we  consider  question  (1.3)  for  the  sample 

problem 


(3.1) 


u.  = Au 
t x 


where  u = u(x,t)  = (u^u^)'  is  periodic  in  x,  x£]R,  0 < t < tQ, 


is  a constant  real-valued  matrix,  and  u^  is  the  more  completely  observable 
component.  This  system  is  symmetric  hyperbolic,  and  it  is  strictly 
hyperbolic  unless  b = 0 and  a = c.  The  ?.inearized  shallow-water 
equations  for  one-dimensional  flow  can  be  written  in  this  form,  Our 
purpose  in  considering  this  example  is  to  gain  some  understanding  of 
what  answers  to  question  (1.3)  can  reasonably  be  expected. 

To  be  able  to  infer  any  information  about  u^  from  u^,  there 
must  be  sufficient  linkage  between  the  equations  of  the  system.  For 
(3.1),  we  must  have  b ^ Oj  otherwise  the  system  uncouples.  Also,  some 
information  about  u2  is  necessary.  This  is  clear  since  u^  = 0, 
u£  = constant  is  a solution  of  (3.1).  A measurement  of  some  linear 
functional  of  ug  will  be  needed. 
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A natural  approach  to  this  linear  system  with  constant 
coefficients  would  be  to  cross-differentiate  and  eliminate  u0,  reducing 
the  system  to  a second  order  equation  for  u^.  If  a = c = 0 apd  b = 1, 
the  system  reduces  to  the  wave  equation  u. , = u for  u ',  and 

bU  XX  J. 

appropriate  data  for  determining  u^  would  be  giving  uL  and  d^u-j 
as  functions  of  x at  time  t = 0.  For  equations  with  variable 
coefficients,  Courant  and  Hilbert  (1962)  point  out  that  this  reduction 
is  not  always  possible.  Also,  solving  for  u^  does  not  solve  our 
problem;  u0  may  be  the  component  of  interest.  We  are  essentially  back 
where  we  started:  given  as  much  information  as  necessary  about  u^, 
find  u . However,  this  approach  does  suggest  that  we  consider  measuring 
u^  and  c^u^  at  t = 0 as  part  of  the  data  necessary  to  determine  u. 

If  time  derivatives  of  u^  are  available,  we  can  use  the 
first  equation  of  system  (3.1)  to  determine  u0  at  t = 0 from 
u^  and  d^u^  at  t = 0 and  Up(0,0).  If  b / 0,  we  can  solve  this 
equation  for  dxiip  yielding 

(3.2)  ax»2  ■ 5 <Vl  • “W 

At  each  time  level,  this  is  an  ordinary  differential  equation  for  Up. 

This  leads  to  a well-posed  formulation  of  the  problem. 

3.1.  Theorem.  Suppose  b ^ 0.  Given  v^(x)  G (^(T),  w(x)  G C(T) 
satisfying  w(0)  = 0,  and  yQ  G (E,  there  exists  a unique  Vp(x)  G Cp'(T) 
aich  that  the  solution  of  (3.1)  with  Initial  conditions 
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(3.3) 


U;L(x,0)  = vL(x)  , u2(x,0)  = v2(x) 


satisfies 


(3.^) 


uL(x,0)  = v L (x) 
a^CxjO)  = w(x) 
u2(0,0)  = yQ 


v2(x)  depends  continuously  on  the  data  v^x),  w(x),  and  y i 


(3.5) 


|v_||  . < |y 

2 C1(T)  C 


2a 

b 


,|vincl(T)  + M I|W|IC(T) 


where 


IML/t>  = sup  |<P(x)| 

0<x<l 


l|tpllcl(T)  ' McfT)  + Mc(I) 


Proof.  (3.2)  implies 

i x 

(3.6)  u2(x,0)  = u2(0,0)  + £ / atu1(c,0)dcr  - ^ (u1(x,0)  - ^(0,0)) 


The  theorem,  including  the  estimate  (3.5),  follows  immediately.  The 
1 

condition  0(0)  = / w.(x)dx  = 0 ensures  the  periodicity  of  v0(x). 

0 L 2 
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3.2.  Corollary.  Let  C(T)  denote  the  subspace  of  q>  £ C(T)  such 
that  $(0)  = 0.  Define 


L:C1(T)  X C1(T)  -+C1(T)  X C(T)  x IE 
by 

L(v1,v2)  = (u.^0),  dtu1(0),  u2(0,0)) 


where  u = (u^u^)'  is  the  solution  of  (3.1)  with  initial  conditions 
(3.3).  If  b / 0,  then  L is  a Banach  space  isomorphism. 

Proof.  Given  (v^v,.)  £ CL(T)  x CX(T),  clearly  at^Vj  + bdxv2  € C(T), 
so  the  first  equation  of  the  system,  viewed  as  an  equality  of  distri- 
butions in  D'(T)  at  t = 0 which  we  can  do  in  light  of  Theorem  2.20 
and  the  remarks  following  it,  implies  that  d^u^(0)  6 C(T)  and 


£ lal,||ui(0)llci(T) 


,(0)ll 


X(T) 


So  L is  a continuous  linear  operator.  Theorem  3.1  implies  that  L 
is  invertible  and  that  L ^ is  bounded. 

This  corollary  shows  that  there  is  a continuous  one-to-one 
correspondence  between  the  standard  initial  data  u^(0)  and  u2(0) 
and  the  nonstandard  data  u^(0),  d^u^O),  and  u2(0,0).  For  more 
complicated  equations,  the  compatibility  conditions  on  the  nonstandard 
data  for  the  existence  of  a solution  will  be  more  complicated,  and  we 
will  concentrate  on  obtaining  estimates  like  (3.5)  from  which  we  can 
obtain  uniqueness  and  continuous  dependence  results. 


The  data  requirements  on  u2(0)  in  this  approach  are  a3  minimal 
as  we  could  hope.  Thinking  in  terms  of  weather  model,  a measurement  of 
u2(0,0)  could  be  obtained  from  a single  weather  station.  We  can  also 
use  the  same  approach  using  u2(0,0),  the  mean  of  u2(x,0),  as  the 
data  required  of  u2(0)  (where  u(£,t)  is  defined  by  (2.35)). 

3.3.  Theorem.  Suppose  b / 0 and  s £ JR.  Given  v1  £ HS(T), 
w £ H (T)  satisfying  0(0)  = 0,  and  € X,  there  exists  a unique 
v2  £ HS(T)  such  that  the  solution  of  (3.1)  with  initial  conditions 
(3.3)  satisfies 

u1(0)  = v1 

(3.7)  d^u^(O)  = w 

u2(°,°)  = 9Q 


v2  depends  continuously  on  the  data  v^,  w,  and 


(3.8) 


IIv2IIs<k(|?0|  + 


'A  + TbT  M_1> 


where  K is  a constant  independent  of  a,  b,  and  c. 


Proof.  (3.2)  implies 

(3.9)  (27ri{)  u2(|,0)  = ^^(1,0)  - f (2tt1|)  Uj_(e,0) 

so  for  I / 0, 


4l 


The  estimate  (5.8)  and  the  theorem  follow  immediately.  The  condition 
0(0)  =0  is  necessary  to  make  (5.9)  hold  for  | = 0. 

3.^.  Corollary.  Let  HS  ^(T)  denote  the  subspace  of  cp  € HS  ^(T) 
such  that  $(0)  = 0.  Define 

L:HS(T)  X HS(T)  ->HS(T)  x HS_1(T)  X <E 

by 

L(vl,  v2)  = (ux(0),  3^(0),  u2(0,0)) 


where  u = (u^,u  )'  is  the  solution  of  (3.1)  with  initial  conditions 
(3.3).  If  b / 0,  then  L is  a Banach  space  isomorphism. 

Corollary  3.k  follows  from  Theorem  3.3  in  the  same  way  that 
Corollary  3.2  follows  from  Theorem  3.1.  Note  that  the  linkage  factor 
| b | , a property  of  the  system,  appears  explicitly  in  the  estimates 
(3.5)  and  (3.8),  indicating  that  the  accuracy  to  which  we  can  compute 
u2(0)  from  the  measured  data  depends  quantitatively  on  the  linkage  in 
the  system:  the  weaker  the  linkage  is,  the  less  accurately  we  can 
expect  to  be  able  to  compute  u2(0). 


A second  approach  to  the  problem  is  to  consider  using  u^(0) 
and  u-^t^)  for  some  t^  € (0,tQ]  to  determine  u2(0)  up  to  a 
constant.  Tills  data  is  analogous  to  the  data  given  in  a two-point 

B 


1 


I 


boundary  value  problem  for  a second  order  ordinary  differential  equation 
which  has  been  written  as  a system  in  the  usual  way. 


L 

3.5.  Lemma.  Suppose  b ^ 0 and  Vg  € HS(T)  for  some  s € 3R  . 

The  first  component  of  the  solution  of  (3.1)  with  initial  conditions 


(3.10)  u1(0)  = 0 , u2(0)  = v2 

is  given  by 


(3.11)  u1(|,t)  = t eTri£t(a+c)  sin(7r|td)  $2(g)  . 
where 

(3.12)  d = ((a-c)2  + kb2)1//2 

This  defines  u^  as  an  element  of  C([0,tQ],  HS(T)). 


Proof.  By  Theorem  2.17,  the  solution  u of  (3.1)  with  initial  conditions 
(3.10)  is  in  C([0,tg],Hs),  so  by  the  equation  (3.1)  itself, 
ut  € C ( [0, tg] ,HS  Hence  u G C^([0,t  ],HS  ^),  so  for  each  g G 1, 
u(g,t)  is  a C1  function  of  t G [0,t  ],  and 

^ (u(g,t) ) = ut(g,t)  *=Aux(g,t)  = (2Trig)  Au(g,t) 

So  u(g,t)  is  the  solution  of  the  ordinary  differential  equation 


(5.13)  ^ = (27rj£)Ay 

with  initial  conditions  y(0)  = (0,  $2(|)).»  Hence 


u(|,t)  = e27Ti|tA(0,  v2(|))' 


The  eigenvalues  of  A are 


(3.14) 


Ai  = ^ (a  + c + d), 


^ = - (a  + c - d) 


where  d is  given  by  (3.12).  The  matrix  with  the  eigenvectors  of  A 
as  columns  and  its  inverse  are 


(3.15)  P = 


We  have 


f 

-b  " 

-1  1 
and  P = - r^ 

■ h-c  b 1 

I 

1 

o 

Vc. 

bd 
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1 

rf 

O 

(3.16) 


P^AP  = 


al  0 


\ J 


If  e| 


(l,0) ' , then 


-1. 


uL(?,t)  = e|Pe27ri?tP  AP  P"1(0,v  (|)) 


*2(i) 

bd 


[-b  -b] 


r £ 

e 

0 


27ri|  t^L 


[ _b  ] 


b 27Ti5tA  27riSt?u 
=d(e  -e  ) ^o(S) 
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and  the  lemma  follows  since  Ax  + Ag  = a + c and  A^  - Ag  = d. 

3.6.  Theorem.  Suppose  b ^ 0 

(i)  If  t^d  is  a rational  number,  then  Ug(0)  cannot  be  determined 
uniquely  from  u^(0),  u1(t1),  and  u2(0,0):  there  are  C°° 
solutions  of  (3.1)  with  u2(0)  ^ 0 and 

(3.17)  ux(0)  = u1(t1)  =0  and  Ug(0,0)  = 0 . 

(ii)  If  t^d  is  an  irrational  number,  then  u2(0)  is  uniquely 

determined  by  u^O),  u1(t1),  and  u (0,0).  However,  for  each 
s € IR , we  do  not  have  continuous  dependence  in  HSi  there  is 
no  constant  K such  that 

(3.18)  l|u2 (0) lls  < K(  |u2(0,0)  | + ||u1(t1)||g) 

for  all  solutions  uG  C([0,tQ],HS)  with  ux(0)  = 0. 

Proof.  If  t^d  is  rational,  then  there  is  an  integer  n ^ 0 such  that 
T^t^d  € Z.  By  Lemma  3.5,  the  solution  of  (3.1)  with  initial  conditions 
ux(0)  = 0 and  u2(0)  = e277^Xrl  satisfies  (3.17)  since  sin^t^d)  = 0. 
This  proves  (i). 

If  t^d  is  irrational,  then  sinGrtt^d)  ^ 0 for  all  integers 
| / 0.  For  the  uniqueness,  we  may  assume  without  loss  of  generality 
that  ux(0)  = 0.  If  not,  let  w and  v be  the  solutions  of  (3.1) 
with  initial  conditions 
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(3.19) 


w]_(0)  = Uj_(0)  , w?(0)  = 0 

(3.20)  Vl(0)  = 0 v,  (0)  = u2(0) 

By  the  linearity  of  (3. 1) , u = w + v,  so  v = u - w.  Theorem  2.17 
implies  that  w^(t^)  is  uniquely  determined  by  u^(0),  and  thus  v^(0) 
is  uniquely  determined  by  u^(0)  and  u^(t  ).  Hence  u (0)  is 
uniquely  determined  by  u^(0),  u (t^)  and  uo(0,0)  if  and  only  if 
vo(0)  is  uniquely  determined  by  v^(t^)  and  v (0,0).  Now  that  we 
have  reduced  the  problem  to  the  case  u^(0)  = 0,  the  uniqueness  follows 
directly  from  Lemma  3.5  since  uo(§,0)  is  determined  by  u^(|,t^) 
for  % t 0 by  (3.11) . 

Since  t^d  is  irrational,  given  e > 0,  there  is  an  integer 
Tj  / 0 such  that  the  distance  from  Tjt^d  to  the  nearest  integer 
is  less  than  e/ir.  Hence  | sin(7rr|t^d)  | < e.  Define  u to  be  the 
solution  of  (3.1)  with  initial  conditions 

u1(0)  = 0 and  u2(0)  = (l  + hi2)"8/2  e"vlxr[ 

Then  !!u2(0)||g  = 1,  u2(0,0)  = 0,  and 

HWllg  = (l  + h|2)S//2  |u1(r),t1)| 

= ~ sin^qt^d) 

Since  e was  arbitrary,  there  is  no  constant  K such  that  (3.18) 
holds  for  all  solutions  u€  C([0,tQ],Hs)  with  u^(0)  = 0. 
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We  call  the  data  (u.^0),  u1(t1)>  u2(0,0))  £ Hs  x Hs  x I 

admissible  if  there  is  a solution  u £ C([0,tQ],HS)  which  yields 

this  data.  Consider  the  mapping  M from  the  space  of  admissible 

data  in  HS  x HS  x I to  HS  which  maps  (u^(0),  u^t-j),  u2(0,0)) 

to  the  uniquely  determined  u£(0).  Let  V be  the  subspace  of 

HS  x H5  x I consisting  of  all  admissible  data  in  (0}  x Hs  x d. 

The  restriction  of  M to  V is  a linear  one-to-one  map  of  V onto 

H by  Theorem  2.1?  and  the  uniqueness  result  above.  Since  (3.l8) 

does  not  hold  for  any  K,  M is  not  continuous  on  V,  so  we  do  not  have 

continuous  dependence.  We  remark  that  V cannot  be  a closed  subspace 

since  if  it  were,  the  open  mapping  theorem  applied  to  the  continuous 
• 2.  s 

linear  operator  M :H  -+  V would  imply  that  M is  continuous. 

This  completes  the  proof  of  Theorem  3.6. 

The  uniqueness  result  in  Theorem  3.6  and  its  dependence  on  the 

irrationality  of  a certain  parameter  is  similar  to  the  result  of 

Bourgin  and  Duffin  (1939)  discussed  in  the  introduction. 

We  can  also  consider  measuring  u., (t)  for  t = 0,  t..  , ...  , t 

J-  l m 

where  0 < t < •••  < t < trt.  As  before,  the  best  we  can  hope  to  do 
is  to  determine  u2(0)  up  to  a constant.  If  t^,  ...  , t are 
rational  multiples  of  each  other,  we  may  assume  without  loss  of  generality 
in  the  following  theorem  (by  adding  elements  to  the  sequence 
• • • > ^m  ^ necessary,  increasing  m)  that  t..  = jt^. 
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3.7.  Theorem.  Suppose  b / 0,  and  for  j = 1, . ..,m. 

(i)  If  t^d  is  a rational  number,  then  u2(0)  cannot  be  determined 
uniquely  from  u^O),  u^t^),  ...  , u^(t  ),  and  ^(0,0). 

(ii)  If  t^d  is  an  irrational  number,  then  u2(0)  is  uniquely 
determined  by  u^CO),  u^t^),  ...  , u^(t  ),  and  u2(0,0), 
but  for  each  s € 1R  , vre  do  not  have  continuous  dependence 
in  HS. 

Proof.  The  proof  is  essentially  the  same  as  the  proof  of  Theorem  3.6. 

If  t^d  is  rational  and  qt^d  € I,  then  qt^d  € Z for  j = 1,  ...,m, 
and  (i)  follows  as  in  Theorem  3.6. 

If  t^d  is  irrational,  the  uniqueness  is  a result  of  Theorem  3.6. 
Given  e > 0,  there  is  an  integer  q / 0 such  that  the  distance  from 
qt^d  to  the  nearest  integer  is  less  than  e/mir.  Then  for  j = l,...,m, 
the  distance  from  qt^d  to  the  nearest  integer  is  less  than  e/ir,  so 
| sin(Trqt^d)  | < e,  and  (ii)  follows  as  in  Theorem  3.6. 

Since  the  distinction  between  rational  and  irrational  numbers  is 
not  practical  when  measurements  are  involved,  if  we  measure  u^  at  a 
fixed  finite  number  of  time  levels  and  u2(0,0),  we  cannot  expect 
theoretical  uniqueness  or  theoretical  continuous  dependence.  Practically, 
however,  the  extra  information  we  have  by  measuring  u^(t)  at  several 
time  levels  may  enable  us  to  determine  u2(|,0)  for  more  small  values 
of  | ^ 0 more  accurately.  We  will  discuss  this  further  in  Chapter  VI. 

Even  though  we  are  restricted  to  using  u^Ct)  at  a finite 
number  of  time  levels  for  computations,  it  does  not  necessarily  follow 
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that  the  appropriate  theoretical  question  involves  a finite  number  of 
time  levels.  For  example,  difference  methods  for  the  numerical  solution 
of  differential  equations  use  discrete  equations  to  model  continuous 


equations.  If  the  time  levels  at  which  we  measure  u^  are  sufficiently 
dense  in  time  in  view  of  the  step  sizes  of  the  discrete  variables  and 
the  accuracy  desired  in  a computation,  we  may  consider  our  measured  data 
as  an  approximation  to  ux  in  C([0,tQ],Hs) . The  following  theorem 
suggests  that  this  may  be  a fruitful  point  of  view  to  take. 


3.8.  Theorem.  Suppose  b ^ 0 and  s and  tQ  are  fixed.  Then  there 
is  a constant  K such  that  for  every  solution  u G C([0, tQ],Hs)  of 
(3.1), 

(3.21)  l|u2(0)|(s  < K(|u2(0,0)|  + ||u1(0)  ||s  + ||U]J|s>2) 

Proof.  Let  u G C([0,tg],  Hs)  be  a solution  of  (3.1).  Let  w and 
v be  the  solutions  of  (3.1)  with  initial  conditions  (3.19)  and  (3.20), 
respectively.  Then  u = w + v.  By  Lemma  3.5, 


IvjU.t)!  = 


^ 3in(7T|td)  u2(t,0) 


so 


t t 

lvlllg  2 = f llvL(t)||g  dt  = / E (1  + III2)8  |v  (|,t)|2  dt 

S’2  0 1 3 0 |GZ  1 


2 0 
■^2  ^ (!  + ll|2)S  |u?(|,0)| 2 I sin2(7T5td)dt 

a eez  * 0 


Since  for  5^0, 


/ sin2(7r?td)dt  = alH  - sln(a|p|) 

0 % d 1 5 | 

"her*  a . 2^1,  clearly  there  1.  a constant  > 0 Independent  of 
5 ^ 0 such  that 


0 2 
/ sin  (ttS td)dt  > K 

0 “ ‘ 


> K0  for  I/O 


Hence 


Now 


|S,(0,0)|2  + -4—  II,  !|2 


tb2K„ 


'A, 2 ^ l|wJs,2  * MIs>2 

< ^ K#.,.  * K«s,2 

SKlK«»»e  + «»!«.,  2 


for  some  constant  Kj  by  Theorem  2. 17.  The  theorem 


follows. 


3.9.  Corollary.  Suppose  b^O  and 
there  is  a constant  K such  that  for 
of  (3.1), 


s and  tQ  are  fixed.  Then 
every  solution  u G C([0,t0J,Hs) 


2)  llu2(°)lls<K(|u  (0,0)|  + (|u  ||  ) 

1 S ,00 

Corollary  3.9  follows  Immediately  from  Theorem  3.8.  These 
show  that  „2(0)  1,  uniquely  determined  by  and  depends  continuously  on 

the  data  ^(t)  for  0 < t < t0  and  1,(0, 0). 


CHAPTER  IV 


APPROACHES  USING  TIME  DERIVATIVES  OF  uJ 


In  this  chapter,  we  investigate  ways  in  which  u 1(0)  can  be 
recovered  from  data  involving  time  derivatives  of  u1  for  the  general 
system 


(1.1) 


xv 

u.  = Z A (x,t)u  + B(x,t)u  + f(x,t) 
j-1  J Xj 


which  we  assume  to  be  s-hyperbolic  (see  Definition  2.16).  In  view  of 
the  difficulties  in  obtaining  higher  time  derivatives  accurately  from 
measurements,  we  consider  data  with  at  most  one  time  derivative  of 
u1.  We  present  two  main  approaches.  The  first  approach  (Theorem  4.1) 
is  the  analogue  of  the  method  used  in  Theorems  3.1  and  3.3.  We  use 
the  first  i equations  of  system  (l.l)  at  time  t = 0 as  a system 
to  be  solved  for  u^I(0),  using  the  data  u*(0)  and  u^(0).  This 
approach  is  similar  to  methods  for  static  initialization  as  in  Ghil 
(1975).  The  second  approach  (Theorem  4.2)  can  be  used  when  there  are 
inhomogeneous  terms  which  are  not  necessarily  continuous  in  time, 
in  which  case  u^(0)  may  not  be  well-defined.  This  approach  yields 
estimates  involving  the  ^((0,^),  Hs(Tk))  norms  of  u1  and  u^. 

In  both  approachs,  sufficient  linkage  in  the  system  between  u1 
and  u11  is  necessary.  The  linkage  may  occur  in  the  first  order 
terms,  the  zero  order  terms,  or  in  a combination  of  the  two.  We 
discuss  sufficient  conditions  in  the  section  on  the  inversion  of 
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first  order  operators,  and  then  put  these  conditions  together  with 
the  two  general  methods. 


We  split  each  of  the  matrices  A.(x,t)  into  blocks; 

d 


(4.1) 


a(21)  a(22) 


where  is  i x l,  A^1'  ^ is  Jt  x (n-i),  a{‘  1 ^ is  (n-£)  x g, 

(22) 

and  Aj  is  ( n-£ ) x (n-£).  Split  B(x,t)  into  blocks  similarly. 
Viewing  t as  a parameter,  define  the  operators 


(4.2)  L^v^(t)  = Z A^v^(x,t)  d +B^w^(x,t)  for  l<p>v<2 


Then  ( 1. 1 ) becomes 


> ' ' * j 


(4.3) 


(fc.M 


I T (11)  I A _ (12)  II  A T 
u^=L  1 u + L 'u  + f 


II  T (21)  I A T (22)  II  A -II 
u = L u + L 'u  + f 


General  methods 

4.1.  Theorem.  Suppose  f £ C([0,tQ],Hs),  and  suppose 
u £ C ( [ 0, tQ ] , Hs ) is  a solution  of  (l.l)  for  which  u^O)  € Hs+1 
and  uj(0)  £ HS.  Then 

(4.5)  I!l<12>(0)  un(0)|l8  < K||uI(0)|ls+1  * ||u'(0)|ls  + ||fI(0)||| 


r<‘  < 


Proof.  Aa  we  remarked  after  the  proof  of  Theorem  2.20,  since  u and 


f are  both  in  C([0,tQ],Hs),  ut(0)  is  a well-defined  distribution 
in  d'(T  ).  The  estimate  (4.5)  follows  immediately  from  (4.3). 


4.2.  Theorem.  Suppose  that  f £ L2((0,1),  Ha+1)  and  that  the  system 


(4.6) 


= L 


(22)y 


is  s-hyperbolic.  Assume  that  for  some  constant  independent  of 

t £ [ 0 , tQ  ] , 


(4.7)  ||v||  < K ( ||L(12) (t)v||  + ||v||  ) 

s u a s-i 


If  u £ C([0,1],HS)  is  a solution  of  (l.l)  for  which  u1  £ L2( (0, t0),Ho+2) 
and  u*  £ L2((0,t0),HCT),  then 


(4.8)  ||un(0)||s  < ^(Hu" 


c+2 , 2 


Ut  c,2  + 


^c+1,2  + 


HuII(0)lls_1) 


and  hence  there  is  a constant  M such  that 


(4.9)  ||uI;C(0)|is  < K2( 

for  some  constants 


£ un(|,0)  + Hu1! 


|i|<M 


c+2, 2 + Ut  a,2 + ^ ’ct+1,2 


and  K_  where  u(|,t)  is  defined  by  (2.35). 
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r.  • / *0"*- 


_ — r&iii,  - 


Proof.  Let  v denote  the  solution  of  (4.6)  with  initial  conditions 

(4.10)  v(0)  = uII(0)  . 

By  the  well-posedness  of  the  s-hyperbolic  system  (4.6)  with  initial 
data  at  time  t and  the  solution  moving  backward  in  time, 

(4.11)  ||v(0)|is  < K'||v(t)||s  for  0 < t < tQ 

for  some  constant  K',  which  can  be  chosen  independent  of  t.  Let 
w denote  the  solution  of 

(22) 

(4.12)  wt  = L w + B 


with  initial  conditions  w(0)  = 0 where 


. ♦ f11 


By  Theorem  2.17, 


(4.13)  IMI0+1>2  < llwlla+l,«  ^ K"(lluI|lo+2,2  + "a+l,2) 


Clearly  u1*  = v + w.  Substituting  this  for  u in  (1.3),  we  obtain 


u1  . t b(12)w  ♦ X,(l2)v  ♦ f1 

t 
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T7" 


SOM**' 


■ 


Hence 


IIl(12)vIL.,  < ll>‘Ill0+1,2  * + fr<12)»||„ , + 


< 0O»8t(||uI||0+2_2  * ||uj||0)2  + l|f|l0+,|2) 


by  (4.13).  By  (4.11),  (4.7)  and  (4. 14) 


t0llv(o)|r-/  ||v(0)|f  dt 
u s o 3 


< const ( / ||v(t)||  dt) 

0 8 


< const  / (||l/  '(t)  v(t)  ||  + ||v  (t ) ||  Jdt 

0 0 


< const(||L(l2)v||^2  + llvll^J 

< constdiu1!!^^  + ti«jn0f2  + iifii0+1>2 + 


and  (4.8)  follows  from  (4.10).  Since  clearly  there  is  a constant  M 
such  that 


(4.15) 


K1l|uII(0)||  , < \ lkII(0)||  + const  E |uZI(|,0)|; 

1 1 2 S ||  | <M 


we  can  pull  the  \ ||uZZ(0)||  1 term  in  (4.8)  over  to  the  left  hand  side, 

and  (4.9)  follows,  proving  Theorem  4.2. 

We  remark  that  (4.6)  is  automatically  s-hyperbolic  for  symmetric 
hyperbolic  systems  (l. 1). 


jTaMfi 


Inversion  of  first  order  operators 


Both  of  the  methods  presented  in  Theorems  4.1  and  4.2  rely 

( 12  ■) 

on  the  partial  inversion  of  the  first  order  operator  L ' defined 
in  (4.2).  In  this  section,  we  consider  first  order  operators  of  the 
form 


k 


(4.16) 

Lv  = Z a.(x)v  + 3(x)v 

j=l  3 XJ 

where 

“jW 

and  g(x)  are  real-valued  £ x m matrix  functions  in 

D(Tk) 

with 

£ > m.  The  principal  symbol  of  L is 

k 

(4.17) 

£ (x,|)  = 2rri  E a.(x)| 
j=l  0 3 

We  discuss  four  possible  ways  of  inverting  L.  The  first  (Theorem 
4.4)  is  analogous  to  Theorem  3.1.  If  x^  is  a time-like  variable, 
we  can  recover  v from  Lv  and  v restricted  to  the  k-l-dimensional 
hyperplane  x^  = 0.  The  second  (Theorem  4.5)  uses  the  observation 
made  by  Lax  (1955)  that  if  the  first  order  terms  of  an  m x m periodic 
first  order  operator  are  symmetric  and  the  zero  order  terms  are 
sufficiently  positive,  then  the  operator  can  be  Inverted  on  Hs. 

In  the  third  (Theorem  4.7),  we  assume  that  £^(x,t)  is  of  full  rank 
for  | / 0,  and  conclude  that  we  can  recover  v from  Lv  and  a 
finite  number  of  Fourier  coefficients  of  v.  The  fourth  (Theorem  4.8) 
uses  a result  of  Friedrichs  and  Lax  (1965)  for  square  systems  combining 
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the  second  and  third  approaches  which  extends  readily  to  the  non- 
square case.  In  each  of  these,  we  desire  to  bound  the  norm  of  v 
by  some  norm  of  Lv  and  an  appropriate  norm  of  the  necessary 
observations  of  v. 

We  will  need  the  following  Lemma  in  the  proof  of  Theorem  4.4. 

4.3.  Lemma.  Let  s be  a non-negative  integer.  Assume  that  system 

(1.1)  is  s-hyperbolic,  that  the  coefficient  matrices  A.  and  B 

J 

are  periodic  in  x and  t,  and  that  f £ HS(Tk+1).  If  u £ C( [0, l],Hs(Tk) ) 
is  a solution  of  (l.l)  which  is  periodic  in  t,  then  u £ Hs(Tk+1)  and 


(4.18) 


Hun  . < K( ||u(0) ||  + llfll  ) . 


HS(Tk+1) 


HS(Tk+1) 


Proof.  We  use  the  spaces  Hs,m(Tk+1)  and  norms  llvll,  , introduced 

(,s,m; 

in  Definition  2.18.  For  a = 0,1,..., s, 


(1  + |||y)S"a  (1  + M*)0  < (1  + |5|*  + | r|*) 


2>o 


i2sS 


/ \ < f ,,,,  < M where 

(s-a,a)  - 11  "Hs^Tk+lj  - 


so  f £ Hs-°»0(Tk+1)  ^ ||f| 

M = ||u(0)  ||  + ||f||  . 

s HS(Tk+1) 

By  Lemma  2.19  (if)  and  methods  similar  to  those  used  in 
Lemma  2.19  (ill),  L2((0,1),  Hs(Tk) ) and  Hs,0(Tk+1)  are  the  same 
space  (as  subspaces  of  D'(Tk+1))  and  ||v||/  = ||v||  are  the 

same  norm.  Since  u£  C([0,1],  Hs(Tk) ) , u £ HS,0(Tk+1),  and 
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(s,0) 


U s,2  * Ml,,. 


< const (||u(0) || g + llf|ls>2) 

< const(||u(0)||s  + Hfll(S/0)) 


< const  M 


The  second  inequality  is  a consequence  of  Theorem  2.17.  The  equation 
(1.1)  now  implies  that  u.  S Hs‘1,0(Tk+1),  and 

w 


Ut  (s-1,0)  ^ c°nst||u||(g^o)  + Hf|l(a>0)  < 


< const  M 


So  by  Lemma  2.19(i),  u S H8-1’ L(Tk+1)  and 


< const  M 


The  equation  (l.  1)  implies  that  u.  £ HS"2’ 1(Tk+1) , and 


K11  (8-2,1)  * COnst||u|l(s-l,l)  + l|fll(s-l,l)  ^ Const 
So  again  by  Lemma  2.19(i),  u £ HS"2,2(Tk+1)  and 


lUll  (s-2,2)  - COnst  M 


Continuing  in  the  same  manner,  we  obtain 


(4.19) 


Ml(fl_a  < const  M for  a - 0,1,...,  s 


■*  .y  . *• 


Now,  s ince 


h 


/ 


V 


r > 


(i  * Is  I2  * M2)8  = Z (“>  a * lei2)8-”  (M2)" 

<7=0  CT 

< const  Z (1  + U|2)8_a  (1  + |t|2)a 
a=0 

we  conclude  that  u £ HS(T^+1)  and 

(4.20)  ||u||  k , < const  Z ||u||/  \ < const  M 

hs(tK+1)  - a=0  (s -a,  a)  - 

and  the  lemma  follows. 

4.4.  Theorem.  Consider  the  operator  L defined  by  (4.l6). 

Suppose  that  there  is  an  m x t matrix  function  r(x)  € d(T  ) such  that 
for  some  i. 


(4.21)  r(x)  a t(x)  = I . 

Without  loss  of  generality  we  assume  i = k.  Let  s be  a non-negative 
integer.  If  k = 1,  v 6 d'(T),  and  Lv  £ HS(T),  then  v £ HS+1(T)  and 

(4.22)  l|v|lB+1  < K(|v(0)|+  ||Lv||8) 

If  k > 1,  we  suppose  further  that  the  equation 

k-1 

(4.23)  w + Z r(x)  a.(x)  w + r(x)  p(x)w  = 0 

k J- 1 0 xj 
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I 

< 

' 


is  s-hyperbolic  with  as  the  time  variable  and  x1  = (x^  . ...x^  )» 

as  the  space  variables.  If  v £ d' (Tk)  is  in  C([0,l],Hs(Tk_1)) 
and  Lv  £ Hs(Tk),  then  v € HS(Tk)  and 


(4.24) 


|v||  < K(||v(x\o)||  + ||Lv||  ). 

s HS(Tk  1)  3 


Proof.  Suppose  k = 1,  v £ d'(T),  and  Lv  € HS(T).  Let  g(x)  = r(x)Lv. 
Then  multiplying  (4. 16)  by  r(x)  yields 


(^•25)  vx  + r(x)  P(x)v  = g(x) 

Let  'i(x)  be  the  fundamental  solution  matrix  of  the  homogeneous 
adjoint  equation 

(^•26)  w x = (r(x)  P(x))*w. 

Then  ® € C [0,1],  and  by  definition  <t> (0)  = I.  We  have  dx($*) 

= <t  r(x)  P(x),  so  multiplying  (4.25)  by  ®*(x),  we  obtain 

^x('®*v)  = $*(x)  g(x)  . 

Since  HS(T)  is  the  set  of  functions  in  CS"1(T)  whose  s-th 
derivative  is  in  L2(T)  and  g € HS(T),  <t>*v  € C8[0, 1]  and 

"v)  £ LP(0,1).  Since  4>(x)*  is  invertible  and  C°°,  v € CS[0,1], 

2 


s 


(4.27) 


— 1 — 1 ^ 

v(x)  = <&*(x)  (v(0)  + ®*(x)  / ®*(o)  g(a)dcr) 

0 

Since  v is  periodic  by  assumption,  we  conclude  that  v £ HS+1(T). 

The  estimate  (4.22)  follows  from  (4.27)  since  ||u||  is  equivalent  to 

s 

the  norm  ||3^v ||Q  • 

Suppose  k > 1,  v € d' (T^)  is  in  C([0,1],  Hs(Tk_1)),  and 
Lv  £ Hs(Tk).  Let  g(x)  = r(x)  Lv.  Then  multiplying  (4,l6)  by 
r(x)  yields 

k-1 

(4.28)  v + Z r(x)  a (x)v  + r(x)  a(x)v  = g(x) 

X j=l  J Xj 

By  Lemma  4.3,  v £ HS(Tk)  and 

||v||  < const(||v(x1,0)||  . . + ||g||  ) 

s HS(Tk-L) 

and  (4.24)  follows,  completing  the  proof  of  Theorem  4.4. 

The  condition  (4.21)  is  satisfied  in  the  case  k = 1 if  and 
only  if  a(x)  is  of  full  rank  for  all  x.  Recall  that  we  have  assumed 
i > m.  If  a(x)  is  of  full  rank,  then  a(x)*  a(x)  is  invertible, 
and  we  may  take  r(x)  = (a(x)*  a(x))-1  cr*(x). 

We  remark  that  the  assumption  in  Theorem  4.4  that  v £ C([0,l],Hs(Tk”^) ) 
can  be  reduced.  Since  v is  a solution  of  the  s-hyperbolic  equation 
(4.28)  and  g £ L2((0,1),  ^(T^"1))  by  the  assumption  that  Lv  £ Hs(Tk), 

n k-1 

Theorem  2.20  implies  that  v £ C([0,1],  H (T  ))  for  some  o,  so 

1 k-1 

v(x  ,0)  is  a well-defined  element  of  D' (T  ).  By  Theorem  2.17,  it 

suffices  to  assume  that  v(x\o)  € Hs(Tk”'L). 


i+.5.  Theorem.  Consider  the  operator  L defined  by  (*4.16). 

Suppose  that  there  is  a real-valued  m X i matrix  r(x)  € o(Tk)  such 
that  r(x)  cr(x)  is  symmetric  for  all  x and  for  j = 1, 

Define 

k 

(^•29)  q(x)  = 2Re(r(x)  P(x))  - Z 3 (r(x)  a (x) ) 

j=l  XJ  J 

Suppose  that  for  some  constant  c^  > 0,  either 

(^•30)  q(x)  > cQI  or  q(x)  < -cQI  for  all  x. 

If  v € H°(Tk)  and  Lv  £ H°(Tk),  then 
^•31)  ||v||0  < k0||Lv||0 


for  some  constant  K, 


Suppose  that  s £ ]R  is  such  that  the  norm 


RL  as  an  operator  from  H to  H satisfies 


2r  < c where  A is  defined  by  Definition  2.lk.  If  v € Hs  and 


for  some  constant  K 


Proof.  We  assume  without  loss  of  generality  that  q(x)  > cQI. 

We  use  the  convention  here  as  in  Definition  2.4  that  if  lower  case 
letters  are  symbols,  the  corresponding  operators  are  represented  by 
capital  letters.  By  integration  by  parts,  the  adjoint  of  the  operator 
RL  is  clearly 

k k 

(RL)*  = - E r(x)  a (x)  d - E b (r(x)  a,(x))  + (r(x)  p(x))* 
J=1  J XJ  J=1  XJ  i 

since  r(x)  CCj(x)  is  symmetric.  So  Q = RL  + (RL)*.  Suppo  se  v S H° 
and  Lv  € H°.  Since  r(x)  £ S°,  the  Schwarz  inequality  and  Theorem  2.11 
imply 

| (RLv,  v)  | < ||RLv||0*  l|v|(0  < const||Lv||0*  ||v||0  . 

So  c0llvllo  = co(v'v) 

< (Qv,v) 

= ((RL  + (RL)*)v,v) 

= (RLv, v)  + (v,  RLv) 

= 2Re(RLv,v) 

< 2 | (RLv, v) | 

< const||Lv||0- l|vM0 

and  (4.31)  follows. 

Since  AS($)  commutes  with  cRL(x,|),  Corollary  2.10  implies 
[A  ,HL]  - PS  , so  AS,RL]A  8 £ PS°  and  by  Theorem  2.11,  [aS,RL]a”S 
maps  H°  continuously  into  H°.  Suppose  that  2r  < c.,  v € HS,  and 

S 0 

Lv  € Hs.  Then 
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r<  |(asklv,  asv)|  < ||asrlv||0-  ||asv||0  = ||rlv||s- ||v||b 

< const||Lv||s-||v||s 

I 

i Also, 

|([As,RL]v,Asv)|  = |(|as,rl]a"Vv,  asv)|  <rs(Asv,Asv) 

- r.K 

SO 

c0  II  vli  g = cq(aSv,ASv) 

< (QASv,ASv) 

< 2Re(RLASv,  ASv) 

< 2|(ASRLv,ASv)  | + 2|  ([AS,RL]v,ASv)  | 

< 2rcllv||"J  + const||Lv||  * ||v||  . 

“So  So 


and  4.32  follows,  proving  Theorem  4.5. 

4.6.  Lemma.  Consider  the  operator  L defined  by  (4.16).  Suppose 
that  for  each  x and  | / 0,  the  matrix  £^(x,|)  defined  by  (4.17) 
has  full  rank.  If  v € Hs-1  and  Lv€Hs-1,  then  v € Hs  and 

(**•33)  llv|l8  < Ks ( ||lv|Is_1  + ||v|ls_1) 

for  some  constant  K . 

s 

Proof.  Since  £^(x,§)  has  full  rank  for  | ^ 0,  £*(x,|)  ^(x,!)  is 
positive  definite  and  homogeneous  of  degree  2 in  | for  |||  > 

Since  and  the  unit  sphere  in  ]Rk  are  compact,  there  is  a constant 

Cq  > 0 such  that 
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(4.34) 


A~2U)  ^(x,*)*  ^(x,|)  > cQI 


for  all  x and  for  |g|  >1.  Let  cp(|)  € c(IT)  satisfy 


(4.35) 


0 < <p  < 1 

<P(|)  s 1 
<P(l)  s 0 


for  | | | < 1 
for  1 1 | > 2 


Define 


(4-36)  p(x,|)  = a“2(|)  ^(x,!)*  £l(x,|)  + cQcp(|)I 


Then  the  matrix  p(x, £)  is  Hermitian  for  all  x and  f,  and 


(4.37) 


p(x,0  > cQI 


Since  (.Ax,^)  £ Sl,  p(x,|)  € S°.  Define  r(x,|)  = p(x,|)”1'. 


Clearly  r(x,|)  € SU.  Let 


«o(*»l)  = oRp(x,|)  - I = aRp(x,|)  - r(x,| ) p(x,|)' 

(x,| ) =a  p (x,|)  - p(x,|) 

L*A  L 


By  Corollary  2.10,  gQ  and  g1  are  in  S-  . 

Suppose  that  v £ H8"1  and  Lv  € H8"1.  Then  since  L*A~2€PS”L, 
L*A_2Lv  £ Hs,  so  Pv  = L*A_2Lv  - G^v  £ Hs.  Hence  RPv  € Hs,  so 
V = RPv  - Gqv  £ H8. 


• a* 

* f 


To  prove  (4.35),  we  first  assume  that  s = 0 
Inequality  (Theorem  2.13)  applied  to  p(x,|), 


cons 


cons 


> Re(Pv,v)  - const||G  v||  • ||v|| 


const  j|v|| 


> const  v 


const  vi 


Since  ||v||  < ||v||  and  ||Lv|| 


< const || v||  , we  have 


< const (||Lv|| 


< const  (||Lv|| 


so  for  some  constant  K, 


For  general  s,  since  we  now  know  that  v € H 


so  by  (4.39)  and  the  fact  that  [L,aS]  € PS 


* 


!|v||s  = I|asv||0  < kq(  I|lasv||_^  + ||aev||_1) 

< k0(||asLv||_1  + ||[l,as]v||_1  + ||asv||_1) 

< const (||Lv||s_1  + ||v||s_;L) 


which  is  (4.33),  proving  Lemma  4.6. 


- 


4.7.  Theorem.  Consider  the  operator  L defined  by  (4. 16).  Suppose 
that  the  matrix  has  full  rank  for  each  x and  each  | ^ 0. 

If  v £ D'(Tk)  and  Lv  € H8"1,  then  v£Hs  and 


(4.33) 


l|v||s  < ks(||lv|Is_1  + llv|ls_1) 


Hence  there  is  a constant  M such  that 

s 


i 

mm 

i 


(4.40) 


WI<K'(  £ |v(s)|  + IIltII  ,)  . 

|5|5<S  S'1 


Proof.  Since  v £ d'(T;,  v € H°  for  some  0 £ ]R  , and  without  loss 

S- i 

of  generality  we  may  assume  v £ H for  some  positive  integer  j. 

Since  Lv  € HS  \ Lemma  4.6  implies  that  v £ Hs”^+\  Applying  Lemma  4.6 
repeatedly,  we  obtain  v £ H and  now  Lemma  4.6  applies  as  stated, 
giving  (4.33).  (4.40)  follows  from  (4.33)  in  the  same  way  that  (4.9) 

follows  from  (4.8). 
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4.8.  Theorem.  Consider  the  operator  L defined  by  (4.16). 


Suppose 

that  there  is  an  m x 1 matrix  symbol  r(x,|)  6 S°  such  that  r(x,|) 
is  Hermitian  and  homogeneous  of  degree  0 in  f for  || | >1  and 

(4.41)  r(x,|)  + £1(x,|)*  r(x,|)  = 0 

Define  the  symbol  q(x,|)  € S°  by 

(4.42)  q = Be(2rP  + E (2dW^  - d¥(ri  ))) 

I r | =i  5X1  5 x 

Suppose  that  for  some  constant  cQ  > 0,  either 

(4.43)  q(x,|)  > cQI  or  q(x,|)  < - cQI  . 

in  an  open  set  C T11  x which  contains  every  (x, |)  with  |||  > 1 

for  which  ^(x,|)  is  not  of  full  rank;  since  q and  are  homo- 

geneous in  | for  |||  > 1,  we  may  assume  that  C { (x, • ): || | >1) 
and  that  (x,|)  £ if  and  only  if  (x,p|)  £ for  p > |||_1. 

If  v £ H°  and  Lv  £ H^,  then 

(4.44)  ||v||0  < K(||Lv||0  + ||v||_1) 
and  there  is  a constant  M such  that 

(4.45)  llv|L  < K'(  Z |v(|)|  + ||lv|L  ) 

0 lei  <M  0 
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Proof.  Let 


<p(0  satisfy  (*+.35),  and  define 

p(x,|)  = A~2ii ) i|(x,|)  i1(x,|)  + cp(|)I 

It  k 

As  in  the  proof  of  Lemma  4.6,  since  T and  the  unit  sphere  in  IR 
are  compact,  there  is  a constant  c > 0 such  that 

p(x,  0 > cl 

for  (x,|)  £ r2  = x - rr  Since  p(x,  |)  >0  elsewhere,  q(x,|) 
is  uniformly  bounded,  and  assuming  q(x,|)  > c^I  in  r^,  there 
is  a b > 0 such  that  for  all  x and  £ 

p(x,|)  + 6q(x, | ) > const  I > 0 . 

Noting  that  p + 6q  is  Hermitian  and  applying  Garding's  Inequality, 

(4.46)  Re((P  + 5Q)v,v)  > const||v||2  - constllvll2^ 

By  (4.4l)  and  Theorems  2.7  and  2.9,  it  is  easy  to  verify  that 

aRL+(RL)*^X’^  = + 6o^x’^ 

where  gQ  6 S-1.  Also 

o p (x>0  *=  P(x>l)  + 6n  (x>l) 

L*A  L L 

where  g^  £ s"’1.  Hence 
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Re((P  +6Q)v,v)  = Re((L*A_2L  + 6(RL + (HL)*) )v, v)  - Re( (&0Q  + G )v,v) 

< |(L*a'2Lv,v)|  + 26' | (RLv, v)  | + |((6G0  + G1)v,v)| 

< IlLvIl^  + const ||Lv|j0  ||v||0  + con8t||v||_1*||v||0 

< const (||Lv||0  ||v||0  + l|v||_1*  ||v||0) 

So  by  (4.46), 

I! v || Q < const (||Lv||0  ||v||0  + ||v||_1*||v||0  + IWI^) 

< const (||Lv||0  ||v||0  + llv||_1'||v|l0) 

and  (4.44)  follows.  (4.45)  follows  from  (4.44)  in  the  same  way  that 
(4.9)  follows  from  (4.8). 

Methods  Using  u^(0)  Restricted  to  a k-I-dlmensional  Hyperplane 
In  the  last  three  sections  of  this  chapter,  we  state  the 
conclusions  which  can  be  drawn  by  combining  Theorem  4.1  or  Theorem  4.2 
with  the  theory  of  first  order  operators  presented  in  the  previous 
section.  The  proofs  are  immediate  consequences  of  the  theorems 
presented  earlier  in  this  chapter.  We  assume  for  the  rest  of  this 
chapter  that  the  system 

k 

(4.1)  u « £ A (x,t)u  + B(x,t)u  + f 

1 J=1  J Xj 

is  s-hyperbolic. 
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it. 9.  Theorem.  Suppose  that  f € C([0,tQ],  Hs(Tk))  and  that  the 
operator  Lv  '(0)  defined  by  (4.2)  satisfies  the  hypotheses  of 
Theorem  4.4.  Suppose  u€  C([0,t0],  Hs(Tk))  is  a solution  of  (1.1) 
for  which  uT(0)  € Hs+1(Tk)  and  uX(0)  € Hs(Tk).  If  k = 1,  then 
uIX(0)  € HS+1(T)  and 

(4.47)  l|uII(0)  lls+1  < K(|un(0,0)|  + ||uI(0)||s+1  + ||uj(0)||a  + 11^(0)^) 

If  k > 1,  and  uXI  € HS(Tk_1),  then 

(4.48)  ||uXI(0)||s  < K( IIuq1)!  s k_x  + ||uI(0)||8+1  + l|uX(0)||s  + ||fI(0)||a) 

H (T  ) 

where  uXX  defined  on  T1^1  is  uXI(0)  restricted  to  the  plane 
xk  = °- 

4.10.  Example.  If  i = n-1,  and  for  some  i and  some  m < n-1,  the 
mn  element  of  the  matrix  Ax(x,0)  (which  we  will  call  a (x))  is  non- 
zero for  all  x,  then  the  hypotheses  of  Theorem  4.4  are  satisfied  for 
L^X?/(0),  and  we  can  apply  Theorem  4.9.  Assuming  i = k for  simplicity, 
let  r(x)  = (0,  ...  , 0,  l/a(x) ) . If  g = L^12)(0)  un(0),  then 
multiplying  this  equation  by  r(x)  yields 

(4.49)  \ un(0)+  Z r(x)  A(l2)(x,0)  b uI]:(0)  + r(x)  B(l2)  (x,0)uI:[(0) 

Xk  J-l  J XJ 

= r(x)g  , 

which,  for  k > 1,  is  a first  order  partial  differential  equation  which 
is  thus  clearLy  s-hyperbolic. 
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4.11.  Example.  The  method  of  Example  4.10  can  be  used  for  some  non- 
linear equations.  We  can  recover  the  geopotential  cp  from  the  winds 


u and  v in  the  shallow-water  equations  (1.5).  Using  the  first  equation 
of  (1.5),  we  can  recover  cp(x,y,0)  from  u(x,y,0),  v(x,y,0),  ut(x,y,0) 
and  cp(0,y,0).  Or,  using  the  second  equation  of  (1.5),  we  can  recover 
cp(x,y,  0)  from  u(x,y,0),  v(x,y,0),  vt(x,y,0)  and  <p(x,0,0). 

Methods  Using  No  Information  About  u^(0) 

s It 

4.12.  Theorem.  Suppose  that  f£C[(0,tQ],H  (T  ))  and  that  the  operator 

( TP  ^ 

Lv  '(0)  satisfies  the  hypotheses  of  Theorem  4.5  for  this  s.  Suppose 
u £ C([0,tQ],  HS(Tk))  is  a solution  of  (1.1)  for  which  uI(0)  £ HS+'L(Tk) 
and  u*(0)  £ Hs(Tk).  Then 

(4.50)  l|uII(0)||s  < K(||uI(0)||s+1  + ||uj(0)||g  + ||fI(0)  ||s) . 


2 s+1 

4.13.  Theorem.  Suppose  that  f £ L ((0,tQ),  H ),  that  the  system 

v.  = l/22^v  is  s-hyperbolic,  and  that  the  operator  l/^(t)  satisfies 

v 

the  hypotheses  of  Theorem  4.5  for  this  s and  for  t £ [0,t„].  Suppose 
u £ C([0,tQ],Hs)  is  a solution  of  (l.l)  for  which  u1  £ L2((0,t0),HS+2) 
and  u*  £ L2((0,tQ),Hs).  Then 


(**.51) 


|un(0)||  < 

o 


K(llullw,2  ♦ 


t s,2 


^8+1,2^  ' 
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Methods  Using  a Finite  Number  of  Fourier  Coefficients  of  u (0) 

4.l4.  Theorem.  Suppose  that  f € C([0,tg],He  and  that  the 

operator  Lv  '(0)  satisfies  the  hypotheses  of  Theorem  4.7.  Suppose 

u is  a solution  of  (l.l)  for  which  u*(0)  € HS  and  u*(0)  6 H8-1. 

II  s 

Then  u (0)  € H , and  there  is  a constant  M such  that 


(4.52)  ||uII(0)  ||  < K(  £ |u(§,0)  | + 11^(0)11+11^(0)  ||  +11^(0)11 

s |g|<M  s * S-L  s— 


where  u(5,t)  is  defined  by  (2.35). 


4.15.  Theorem.  Suppose  that  f € L2((0,t0),H8),  that  the  system 

v.  = l/22^v  is  s-hyperbolic,  and  that  the  operator  l/12^(t)  satisfies 
z 

the  hypotheses  of  Theorem  4.7  for  t £ [0,tg].  Suppose  u £ C([0,tg],Hs) 
is  a solution  of  (l.l)  for  which  u*  £ L2((0,tQ),H8+^).  Then  there 
is  a constant  M such  that 


<‘••53)  IIuii(o)II8<k(|5|Sm|uii({,o)|  + ll«I»^lj2*Kll..1(2*IM.jS,) 


4. 16.  Theorem.  Suppose  that  f £ C([0,tQ],H  ) and  that  the  operator 
(0)  satisfies  the  hypotheses  of  Theorem  4.8.  Suppose 
u £ C(  [0,tQ],H°)  is  a solution  of  (l.l)  for  which  u^O)  £ H1  and 
u^(0)  £ H^.  Then  there  is  a constant  M such  that 

(4.54)  ||un(0)||  <K(  £ |unU,0)|  + ||uI(0)||  + ||uf(0)|L+ 11^(0)11  ). 

0 |ft|<M  l t 0 0 
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CHAPTER  V 
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APPROACHES  WITHOUT  TIME  DERIVATIVES  OF  u1 

We  now  consider  recovering  uXX(0)  from  data  which  does  not 
involve  time  derivatives  of  uX.  As  we  saw  in  Chapter  III,  measuring 
uX(t)  at  several  time  levels  does  not  yield  the  continuous  dependence 
of  uXX(0)  on  the  measured  data,  but  we  may  have  continuous  dependence 
using  uX(t)  for  0 < t < tQ.  We  restrict  our  attention  here  to  strictly 
hyperbolic  equations  with  constant  coefficients.  Theorem  5.6  is  similar 
to  Corollary  3.9/  but  as  in  Chapter  IV,  the  linkage  conditions  are  more 
complex  than  in  the  two-by-two  example  of  Chapter  III.  We  also  investi- 
gate the  effects  of  lower  order  terms  on  the  data  necessary  to  recover 
uIX(0). 

We  use  slightly  different  norms  in  this  chapter;  the  main  tool 
in  our  approach  is  Theorem  5.5/  and  its  application  is  more  direct  in 
these  norms. 

5.1.  Definition.  For  any  s £ ]R , define 

(5.1)  |u|  = sup  k (1  + |||)s  |u(|)| 

|GZ 

MS(T^)  is  the  Banach  space  of  all  u £ d'(T^)  for  which  |u|  < °°. 

s 

The  norm  on  C([0,tQ],Ms)  is 
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r 

t 


(5.2) 


1 S , 00 


sup  | u | 

<>£‘£‘0 


We  remark  that  HS  C MS  for  all  s,  and  if  s.  > k/2  + s^ , then 
S1  S2 

M C H The  induced  inclusion  maps  are  continuous. 

We  now  present  several  lemmas  needed  in  the  proof  of  Theorem  5.6. 
It  is  well  known  (see  e.g.  Ortega  (1972))  that  for  n X n matrices  Q, 


(5.3) 


I^L  = max  £ k,l 

1 < i < n j=l  J 


The  following  lemma  appears  in  Gautschi  (1962), 


5.2.  Lemma.  Let  = V (x^, • • • > xn)  denote  the  Vandermonde  matrix 


(5k 


V = 
n 


1 

x. 


1 

X 


n-1 


n-1 


Then 


(5.5) 


det  V = II  (x  .-x. ) 

1< i < j < n J 


If  the  x^'s  are  distinct,  then  is  invertible  and 


(5.6) 


|V  'L|  < max 

1 n '<»  — 


1 + h1 

n — *L_ 


1 < i < n 1<  j < n ,xj  ‘ xil 

j/i 
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-V*  '*• 


T — 


■ ■ k- - 


- r - 

b * 


f 

• r > 


5.3.  Lemma.  Let  A^,  ...  , An  be  distinct  real  numbers,  and  define 


(5.7) 


iVt  iAt 

V(ti)  = Vn(e  1 1,  ...  , e nl) 


If  we  say  that  ) vCt^) = °°  when  V(t1)  is  singular,  we  have 


(5.8) 


0<t1<  [^/(n-!)] 


|v(t1)‘1L  < p(t0) 


where 


(5.9) 


P(tQ) 


inf  ( max  II  |sin  — (A.-A^)t1|  ) 

OCt.^  [tQ/(n-l)]  1<  i < n l<j<n 

3 i 


iA.t1 

Proof,  Follows  directly  from  Lemma  5.2  with  x..  = e J since 
" J 

1 + lx.  I = 2 and 
1 3' 

|Xj-xJ  = | exp [ i | (Aj-Ai)ti]  - exp[i  | | 

= 2 j sin  \ (AJ-Ai)t1| 

5.4.  Lemma.  Let  A^  ...,  An  be  distinct  real  numbers,  and  define 

(5.10)  6,  = min  | A -A  | , 6 = max  I A.-A  | 

1 l< i<  d <n  J 1 1< i < j < n • 


and  p = 52/51  > 1.  If  p(tQ)  is  given  by  (5.9)  and 
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(5.11) 


t#  = (n-1}  i • rh 


then  e(tQ)  <K'(tQ,  Br  &2)  < K"(tQ,  6^  &2)  where 


(5.12)  K’(t0,  61,  B2)  = 


( ■ 6it°  r 

\Sl" 

(\l-n 

sin  rh) 


if  0 < t < t 


if  tQ  > t 


(5.13)  K"(t0,  &v  B2)  = 


te)"1 


/i* 

V 2 


if  0 < tQ  < t 


if  tQ  > t 


= max 


(n-l)irY'1  h*  ^°'1 

. 6iV  ’ l 2 M 


Proof . t is  chosen  so  that 


and 

(5.14) 


r,  ^ 1 R t 7T  1 _ t 

0 2 61  n-1  - 2 - 2 62  n-1  7r 


* * 
1 c t . 1 _ t 

sln  2 51  ■ Sln  2 62 


Since 


is  4 


t r 


, 1 c t 

and  — 5„ 


jrp_ 


2 1 n-1  1 + p “““  2 2 n-1  _ 1 + o 


(5.14)  holds  because  7rp/(l+p)  = 1 - (tt/(1+p)). 


ft  I 

' • . . j 
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,/Ht 


r 


If  0 < tg  < t , choose  t^  = t^/Cii-l).  Then  for  j ^ i, 
2 ^1  n-T  — \ 1 YNlK  - \ 62  HIT  = r T~o 


Since  tQ  < t , 


1 R "O  ^ 1 c t 7T 

2 1 n-1  - 2 &1  n^T  ~ TT~S 


so  the  minimum  of  sin  x for 


[i  6 ^ 

|_2  bl  n-1  ’ 1 + p 


is  taken  on  at  the  left  endpoint  of  this  interval.  So 

lsU,(k  (VWI  " si“(r  lyW  > 2^15 


and  hence 


B(t0) 


- (sin  al^iy) 


!f  tQ  > t*,  choose  tx  = t*/(n-l) . Since 


±8  J£  = _21_ 

2 1 n-1  l + p 


the  same  argument  shows  that 


e(V  s (sl"  rr ~J  ° • 


The  fact  that  sin  x < (2/7 r)x  for  x £ [0,  tt/2]  implies 
that  K' (tQ,  5^,  62)  < K"(tQ,  6^,  &2).  This  proves  Lemma  5.^. 


5.5.  Theorem.  Consider  the  system  of  ordinary  differential  equations 


(5.1^) 


yt  = iAy 


where  y = y(t)  € (En  for  t > 0 and  A is  a complex-valued  n x n 
matrix  with  distinct  real  eigenvalues  A^, ...,An.  Let 


r . = (r  , ...  , r . ) 
J Jl  jn' 


denote  an  eigenvector  of  A belonging  to  A . Let 

J 


P = [rx,  ...  , rQ] 


denote  the  matrix  whose  columns  are  r^,  ...  , r^.  If  r“  ^ 0 
for  j = 1, . . . , n,  we  have  for  each  tn  > 0, 


(5.15)  |y(0)l< 


P(tQ)  sup  |y  (t) 

1 < j < n J 


where  ?(tQ)  is  given  by  (5.9). 


Proof.  Let  D = diag(A1, . . . , A ),  the  diagonal  matrix  with  A^ ...,An 
on  the  diagonal.  Then  P ^AP  = D and  PDP  ^ = A.  The  solution  of 


(5.1*0  is 


y(t)  = PeiDt  P”1  y(0) 


MMb 


Let  w = P y(0).  Then  for  1 < m < £,  consider  that 


iX t iX  t 

ym(t)  = (rlm’  •••  ' rnm)  dia«(e  ’ •••  * e “ )w 


iX  t iX  t 

= (e  , , e n ) diagfr^,  ...  , rj» 


If  we  define 


y(m)(tl}  = (ym(0) ’ ya(V’-‘" 


then  for  0 < t.,  < tn(n-l)  we  have 


y(m)(V  = V(tl)  diae(rLn’  ’ rnm)w 


where  V(t1)  is  defined  by  (5.7).  Hence  if  V(t1)"'1  exists, 
dla«(rlm r„m>“  ■T<ti>"1  • 


< |v(t1)'  | sup  |y  (t) | 
o < t < tQ  m 


Taking  the  infimum  over  all  such  t^, 


w I < [ i 

a ^ '0<t.  < 


inf  |V(t  )-1|  ) sup  |y  (t) ) . 

0<ti-Cto/(n"1)]  ”/°<t<to  m 


By  Lemma  5.3> 


< p(t  ) sup  |yI(t)| 
o < t < 


Taking  the  maximum  over  m for  1 < m < &, 

|rjL  |wjl  i B(V  „ T*.  lyI(t>L 

U X>  ^ T>.~ 


Hence 


w < 
1 1 00  — 


3(t  ) sup 

0 < t < t0 


yI(t) 


min  I r^ I 
l<j<n  J “ 


and  the  theorem  follows  since  |y(0)|  < |p|  • w|  . 

1 1 oo  — 1 'oo  1 00 


Equations  Without  Lower  Order  Terms 
5.6.  Theorem.  Consider  the  system 

k 

(5.16)  u.  = £ A.u 

j=l  3 j 

where  the  A.'s  are  constant  real-valued  n X n matrices.  Suppose 

d 

(5.16)  is  strictly  hyperbolic,  i.e.,  suppose  that  the  matrix 

k 

(5.17)  A(|)  = £ A 

3=1  J J 

has  distinct  real  eigenvalues  A^t)  > AgCl)  > •••  > A^Ct)  for  all 
non-zero  5 € ]R  . Let  r^Cg),  ...  , rn(5)  denote  corresponding 
normalized  eigenvectors.  Assume  that  for  each  o>  € K with  joj|  = 1, 
we  have 
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and  let  s be  a given  real  number.  If  u 


solution  of  (5.16),  then 


where  u(|,t)  is  given  by  (2.35)  and  K(tQ)  is  a constant,  depending 

l™n 

on  tA.  K(t. ) behaves  like  t as  tA  -*  0 and  is  constant  for 


Proof.  Define  for  to  G Ik  with  |cjd|  = 1 


Since  A(|)  has  distinct  eigenvalues,  the  eigenvalues  and  normalized 
eigenvectors  of  A(co)  for  |co|  = 1 can  be  chosen  to  be  continuous 


locally.  See  Theorems  3.1.2  and  3.1.3  in  Ortega  (1972).  Since  the 


unit  sphere  in  ® is  compact,  the  following  three  numbers  are 


positive  and  finite 


(5.21) 


^ = inf  5 (to),  H = sup  6 (o>) 

1 M-i  1 2 M-i  2 

Suppose  | £ Z and  1^0.  Let  o>  = |/|||.  By  the  equation 
(5.16)  and  remarks  as  is  the  proof  of  Lemma  3.5, 

^ u(|,t)  = 27Ti  A(|)  u(|,t)  = i 2tt|||  A (u>)  u(|,t)  . 


The  eigenvalue^ of  27t|  i j A(co)  are  27r|||  A-^ou),  ...  , 2tt|  S | An(co). 

The  and  6 2 in  Lemma  5.^  for  these  numbers  are  |||  61(ai) 

and  |||  62(cu).  Let  K"(t0,  6^,  &2)  be  given  by  (5.13).  Since  K" 
clearly  increases  if  either  6^  decreases  or  &2  increases, 

K"(t0,  |||  &1(o5),  |||  &2U))  < K"(t0,  lll^,  |||n2).  Also,  note  that 
if  and  &2  decrease  with  p = &2/s^  remaining  constant,  K" 

increases.  Hence  K"(t0,  (l)^,  |g  |n2)  < K"(t0,  Ml,  ^ . Let  P(o>) 
be  the  matrix  whose  columns  are  r^co),  ...  , rn(^).  Clearly, 

|P(o>)  1^  < n.  By  Theorem  5-5  and  Lemma  5.*+, 

| P(oj)  J 

|u(5,0)|ro<  ~Y — K"(t0,  III  B (oj),  1 1 |B  (oi) ) sup  |uI(|,t)|00 

min  |r*(®)L  0 1 2 0<t<t_ 

l<j<n  3 ~ u 

- t MP)  SUP  |uI(!,t)| 

K 1 ^ 0<t<tQ 


So 

(5.22)  |nIJ( |,0)  | < \/r£I  1^(1, 0)L 

< K(t  ) sup  |uI(|,t)| 

^ r\  ^ 4.  ^ 4. 
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where 


K(t0)  =^pK"(t0,  Pl,  pg). 

By  (5.13),  K(tQ)  Behaves  like  t^-11  as  tQ  -*  0 and  is  constant  for 
large  tQ. 

Since  (5-22)  holds  for  each  £ ^ 0 in  Z , we  have 
!uI;C(0)|s  = su^.  (1  + |||)S  |uII(|,0)  | 

< |ui:E(0,0)|  + K(t0)  sup.  (1  + J||)s  sup  |uI(?,t)| 

££Z  0<t<tQ 

= |uXI(0,0) | + K(tn)  sup  lu1 J 

0<t<tQ  S 

and  the  theorem  follows. 

We  remark  that  the  dependence  of  K(t^)  on  tQ  indicates 

that  up  to  a certain  point,  the  longer  the  time  interval  over  which 

we  measure  u^,  the  more  accurately  we  can  expect  to  determine  u^(0). 

Note  that  K(tQ)  must  go  to  °°  as  t^  -» Oj  otherwise,  u**(0)  would 
I ~II 

he  determined  hy  u (0)  and  u (0,0). 

Restriction  (5.18)  is  necessary  for  a dense  set  of  cu's  in 
k I 

the  unit  sphere  of  ]R  . Suppose  =0  for  some  non-zero  11  in 

k I 

Z and  some  j;  this  holds  if  and  only  if  r (a>)  = 0 for 

J 

'u  = rj / 1 tj  } . If  <p(x)  is  any  scalar-valued  function  such  that 
$(?)  = 0 unless  | is  a non-zero  integral  multiple  of  tj,  and 
v^(x)  = p(x)  rj(Tl)>  hhen  the  solution  of  system  (5.16)  with  initial 
conditions 


8k 


(5.23) 


u(0)  = 0 , 


uII(0)  = v„ 


satisfies  u^(t)  = 0 and  u^(0,0)  = 0.  Hence  uII(0)  cannot  be 
determined  from  u^(t)  for  0 < t < t^  and  u^(0,0)  in  this  case. 


5.7.  Example.  We  present  sample  conditions  under  which  condition 

(5.18)  is  satisfied.  Suppose  l = n-1  and  k < n-1.  Let  g.  be 

<3 

the  last  column  of  A..  If  g*,  ...  , g*  are  linearly  independent, 

J 1 K 

then  (5.18)  is  satisfied.  The  linear  independence  of  gp  ...  , g* 
implies  that  the  last  column  of  A(|)  with  its  n-th  component 
removed  is  not  zero  for  all  | / 0 . in  E , and  hence  (0, . ..,0,1)' 
is  not  an  eigenvector  of  A(|). 

In  general,  if  A.  ' denotes  the  upper  right  l x (n-i) 

V 

( PP) 

block  of  A.  and  A.  ' denotes  the  lower  right  (n-i)  x (n-i) 

0 0 

block  of  A.,  then  (5.18)  is  satisfied  if  and  only  if  for  all  1^0 


k (22)/  r»k  (22) 

in  ]R  , every  non-zero  eigenvector  of  Av  '(f)  = Z._,A.  . is 

J J <3 

not  in  the  kernel  of  A^2^(|)  = a(*^|  .. 


Equations  with  Lower  Order  Terms 

Theorem  5.6  no  longer  holds  in  the  same  form  if  there  are  lower 
order  terms  in  the  equation,  as  the  following  example  illustrates. 
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with  k = 1 and  i.  = 1.  The  eigenvalues  of  A are  0 and 


+ \J2,  so  (5.2 4)  is  strictly  hyperbolic.  In  the  notation  just  above, 

the  kernel  of  A = (l>  0)  is  spanned  by  (0,  1)',  which  is  not  an 

(22) 

eigenvector  of  A , so  the  hypotheses  of  Theorem  5*6  are  satisfied 

for  the  equation  ut  = Aux<  However,  it  is  easy  to  see  that 

u(x,t)  = (0,  cos  2tt x,  sin  2ttx)  ' 

is  a solution  of  (5.24)  for  which  uI(t)  2 0 and  ui:E(0,0)  = 0. 

We  can  still  obtain  the  continuous  dependence  of  uII(0) 
on  u (t)  for  0 < t < t^  and  the  necessary  measurements  of  u^(0). 
The  difference  is  that  we  have  to  measure  more  than  just  the  | = 0 
Fourier  coefficient  of  u (0)j  a finite  number  of  Fourier  coefficients 
will  suffice. 

5.9.  Theorem.  Consider  the  equation 

k 

(5.25)  u,  = Z A u + Bu 

* j-i  Jxj 
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where  B is  a constant  real -valued  matrix.  Suppose  that  the  correspond- 
ing system  (5.16)  with  B = 0 satisfies  the  hypotheses  of  Theorem  5.6. 
Then  there  is  a constant  M,  independent  of  s and  tQ,  and  a constant 
K depending  on  s and  tQ  such  that  if  u € C([0,t0],  Ms)  is  a 
solution  of  (5.25),  then 

(5.26)  |uri(0)|  < max  (1  + |||)s  |un(i,0)|  + K|uX| 

|||<M  s'°° 

Proof.  Define  A(|)  by  (5.17).  Suppose  | £ Zk  and  | ^ 0.  Let 
a)  = i/\i  |.  By  the  equation  (5.25), 

(5.27)  ^£uU,t)  = (2rriA(i)  + B)  u(|,t) 

= i 2tt||  I (a(o>)  + 

The  eigenvalues  A (o>,  |^|)  of  A(oj)  + B/ (2vri | ^ | ) need  no  longer 
J 

be  real,  but  if  |||  is  large  enough,  their  real  parts  will  be 
distinct  and  uniformly  separated  and  their  imaginary  parts  will  be 
uniformly  bounded. 

If  z.,  Zg  £ I and  |lm(z  )|  <1  for  j = 1,2,  then 
-1  I *z-j| 

e < le  I < e and  it  follows  easily  that 

_!|  iRe(z1)  iRe(z  )i  1 iz  iz  . 1 iRe(z  ) iRe(z  ) 

e I e - e | < |e  - e | < e|e  1 - e d 


So  Lemma  5.5  carries  over  to  complex  numbers  A, , . . . , \ if 

1 ’ n 

I Im(A.) | <1  provided  that  Re (A  ) , ...  , Re(A  ) are  distinct, 


A.  - A.  is  replaced  by  Re(A.  - A.)  in  (5.9) > and  P(t  ) is  rePlaced 
J 1 J » 

by  (2e)n  1 P (tQ)  in  (5.8). 

Because  of  the  compactness  of  the  unit  sphere  in  1R  and 

the  continuity  of  the  eigenvalues  and  local  continuity  of  the  normalized 

eigenvectors  rj(oo,  |||)  (for  |||  large  enough  to  make  the  eigenvalues 

A.(co,  |||)  of  A(cu)  + B/(27ri|)  distinct),  there  is  an  M such  that 

if  |||  > M and  cu  is  in  the  unit  sphere,  then  |lm(A.(o>,  |||)  <1, 

J 

the  real  parts  of  A^(cd,  |l|)  are  distinct,  there  are  uniform  positive 

lower  and  upper  bounds  on  |Re(A..(a>,  |||)  - A^co,  |||))|  for  j / i, 

and  there  is  a uniform  positive  lower  bound  on  |r^(co,  |l|)|  . The 

J 

proof  of  Theorem  5.5  goes  through,  and  the  theorem  follows  as  in  the 
proof  of  Theorem  5.6,  applying  the  analogue  of  Theorem  5.5  to  (5-27) 
for  | | | > M. 

The  Effect  of  the  Coriolis  Term  on  the  Linearized  Shallow-water  Equations 
We  consider  the  application  of  the  methods  of  this  chapter 
to  the  linearized  shallow-water  equations 

\ + Vx  + v0uy  + cpx  - fu  = 0 ) 

(5.28)  vt  + uQvx  + vQvy  + c?y  + f u = 0 

*Pt  + + Vy}  + Vx  + Vy  = 0 

where  u^,  vQ,  cpQ,  and  f are  constants.  From  physical  considerations, 
we  assume  that  cp^  > 0.  As  we  noted  in  Chapter  I,  simulation  experiments 

I 

i 
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indicate  that  the  wind  field  does  not  adjust  to  the  mass  field  in  the 
tropics  when  intermittent  updating  is  used,  but  adjustment  in  mid- 
latitudes does  occur.  We  can  explain  this  effect  for  the  linearize! 
system  (5.28).  In  the  original  system  (1.5),  the  Coriolis  parameter 
f is  0 at  the  equator  and  relatively  large  in  absolute  value  in  the 
mid-latitudes.  We  examine  the  effect  that  the  size  of  |f|  has  on 
the  possibility  of  recovering  the  winds  u(x,y, 0)  and  v(x,y,0)  from 
the  geopotential  cp(x,y,t)  for  0 < t < tQ  end  a finite  number  of 
Fourier  coefficients  of  the  winds  at  time  t = 0. 

If  f = 0,  then  u(x,y,0)  and  v(x,y,0)  are  not  uniquely 
determined  by  cp(x,y, t)  for  0 < t < tQ  and  a finite  number  of 
Fourier  coefficients  of  u and  v at  time  t = 0.  For  system  (5.28), 
the  matrix  A(|)  defined  by  (5.17)  is 


A(|)  = 


Vl  + V062 


I-  ^0^1 


u0^1  + v0^2 


"0*1  + v2 


For  each  § £ Zk,  the  vector  r(|)  = (-|  , § , 0)'  is  an  eigenvector 
of  A(|),  and  the  component  of  r(|)  corresponding  to  cp  is  0. 

As  in  the  remark  following  Theorem  5.6,  this  implies  that  an  infinite 
number  of  the  Fourier  coefficients  of  the  winds  u and  v at  time 
t = 0 cannot  be  determined  by  cp(x,y,t)  for  0 < t < tQ. 

If  f ^ 0,  however,  then  u(x,y, 0)  and  v(x,y, 0)  are  uniquely 
determined  by  <p(x,y,t)  for  0 < t < tQ  and  the  (0,0)  Fourier 
coefficient  of  u(x,y,0)  and  v(x,y,0). 


* 


k 


t 


5.10.  Theorem.  Consider  the  system  (5.28)  with  cp^  > 0 and 
f / 0.  Given  s and  t 0,  there  is  a constant  K such  that  if 
z = (u,v,cp)'  E C([0,tQ],  Ms)  is  a solution  of  (5.28)  for  which 
cp  £ C([0,t0L  Ms  x),  then 

(5.29)  |w(0)|s  < | w(OjOjO)  | + K|cp|s+1^ 

where  w = (u,v)'  and  w(|^,l2Jt)  is  defined  by  (2.35). 


Proof.  Setting  cQ  = ^/p~  , dividing  the  third  equation  of  (5.28) 
by  c0,  and  replacing  cp  by  c^cp,  we  obtain  the  system 


(5.30) 


1 

0 

Vi 

O 

1  

u 

+ 

-f  0 0 

V 

_ 0 0 0 m 

. . 

Clearly  it  suffices  to  prove  (5.29)  for  the  system  (5.30).  If 
z = (u,v,cp)'  £ C ( [ 0 , tQ ] , Ms)  is  a solution  of  (5.50),  then 

(5.3D  ^ z(fc,t)  * i Q(|)  z(|,t) 

where 


90 


j 


i 


(5.32) 


Q(6)  = 


-r(i) 


-f 


[_2Tricol  x 


-if  -2iric06^  ‘ 

-r(§)  -27ric0l2 

-27ric0l2  -r(l)  . 


where  r(l)  = 27t(Uq| x + vQS2).  As  in  Elvius  and  Sundstrom  (1973), 
the  eigenvalues  of  Q(|)  are 


(5.33) 


\(i)  = -r(t) 

\>(0  = -r(  t)  + a(i) 
^(l)  = -r(i)  - a(l) 


where  a(|)  = (4tt2c2|||2  + f2)1^2.  The  corresponding  eigenvectors 
for  |/0  are 


rx(l)  =27rc0|||2  (-2iric0l2,  Siric^,  f) 

(5.34)  r2(|)  = f(i|2f  - ifil),  -Uxf  - f2aU),  2ttc0|||2)' 

r3(|)  = f(i?2f  + 5^(5),  -ilLf  + l2a(l)»  2ttc0 1 6 1 2 ) ’ 


Note  that  since  f / 0,  ct(|)  > 0,  so  the  eigenvalues  of  Q(|)  are 
distinct  and  real  for  all  l . 

To  obtain  the  winds  from  the  geopotential,  we  want  r*  to 
denote  the  third  component  of  r^  and  r ^ to  denote  the  first 

2 

two  components  of  r..  Suppose  £ ^ 0*  The  scaling  factors  2ttCq|5| 

J ^ 

and  f appearing  in  (5.34)  were  chosen  to  equalize  l^l,*  for 
j = 1,2,3.  We  have 
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(5.35) 


Cl 


(5.35)  min  |r*(|)|  = |2ttc  f|||2| 

1<  j <3  J 

Let  P(|)  = [r^(i)>  r2(0>  rj(l)]  denote  the  matrix  whose  columns  are 
rx(5)»  ^2(|),  r^(| ) . By  (5.3),  !?({)!„  is  its  maximum  row  sum,  where 
a row  sum  is  the  sum  of  the  absolute  values  of  the  elements  in  that  row. 
The  first  row  sum  of  P(|)  is  bounded  by 

W2c2|||2||2|  + 2(W2c2|||2||1|2f2  + |||2f4)l/2 

<47r2c2|||2  |s2|  +2(2ttc0|||  IiJ  |f|  + ||  If2) 

< W2c2|||5  + l+7rc0 U | 2 |f|  + 2 1 1 1 f2 

< M5l0rco|i|  + |f|)2 

Similarly,  the  second  row  sum  of  P(|)  is  also  bounded  by 
MsKttCqIsI  + |f|)2.  The  third  row  sum  of  P(|)  is  6rrc0|t|2  |f|, 
which  is  also  bounded  by  4|||(ttc0|||  + | f j ) 2 since  £ / 0.  Hence 


(5.36) 


lp(i ) L 


min  | r^ ( | ) | 
1<  j<3 


dU#l«» 


Now 


t (ire0|ft  | + |f|)‘ 
27TC0  |f|  |i|2 

+ lfD2 

“ TTC0|f|  I1! 


\\U)  - >2(t)|  = |x1(ft)  - >3(1)1  = \ |x2(6)  - >3(1)  I - a(e) 


For  |i|  large  enough,  a(f)  [tn/(n-l)]  > 2tt/3,  bo  if  t ({)  * (2tt/3)  a(j) 


f then  0 < tx  < tQ/(n-l)  and  sin(|  a(£)  t^O)  = sin(a(|)  t1(0)  = v^/2. 

Ill  f 

Hence  the  3(tQ)'  s (depending  on  g)  defined  by  (5.9)  are  bounded 
<v  independent  of  I / 0.  Applying  Theorem  5.5  to  equation  (5.31), 

we  obtain 


(5.37) 


|z(£,0) | < const | i | 


°<t<t0 


|5>(l,t) 


The  theorem  follows  immediately. 

Note  that  by  (5-36),  the  constant  K in  (5.29)  goes  to  “ 


CHAPTER  VI 


COMPUTATIONAL  METHODS 


In  this  chapter,  we  consider  question  (l.U)  for  the  sample 
equation  of  Chapter  III 


(5.1) 


u.  = Au 
t x 


where  u = u(x,t)  = (u^u^)'  is  periodic  in  x,  x£]R,0<t<  t^, 


[::] 


is  a constant  real-valued  matrix,  and  u^  is  the  more  completely 
observable  con^ionent.  We  discuss  computational  methods  for  the  recovery 
of  Ug(x,0)  using  the  theoretical  results  of  Chapter  III.  The  results 
of  our  computations  are  in  close  agreement  with  the  theory  of  the 
equations  which  has  been  developed. 

In  our  computations,  we  use  the  test  equation 


(6.1) 


-1  -5 


-5  -1 


The  matrix  A in  (6.1)  is  chosen  to  model  the  signs  and  relative 
sizes  of  the  elements  of  the  matrix  in  the  linearized  shallow-water 
equations  for  one-dimensional  flow 
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(6.2) 


v 


ix 


See  equation  (5.30). 


Methods  Using  Time  Derivatives  of  u1 

By  Theorem  3.1,  u2(x,0)  can  be  determined  uniquely  from  and 
depends  continuously  on  the  data  u^XjO),  d^u^^O),  and  u2(0,0) 
if  b / 0.  We  assume  that  this  data  is  available.  The  first  equation 
of  the  system  (3.1)  is  an  ordinary  differential  equation  for  u2(x,0) 

(3.2)  a,u2 . ± <Vi  * oSxui) 

and  the  initial  data  u2(0,0)  for  this  equation  is  available.  So  we 
can  solve  (3.2)  numerically  to  obtain  an  approximation  to  u2(x,0). 

Although  c^u-j^  appears  explicitly  in  (3.2),  we  can  solve  (3.2) 
without  measuring  or  computing  an  approximation  to  d^u^.  If 

w(x)  = u2(x,0)  + ^ u^x.O),  then  (3.2)  becomes 

(6.3)  g = £ atUi(x,°) 

We  solve  (6.3)  numerically  for  w,  and  then  compute  u2(x,0)  from 
w(x)  and  u^(x,0).  Using  the  notation  of  Henrici  (1962)  for  linear 
multistep  methods,  we  solve  (6.3)  approximately  by  the  leap-frog 
method 
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-mv'" 


7 . 


»1 


(6.4) 


yn+2  " yn  “ 2hfn+l 


where  h is  the  step  length  in  the  x direction.  We  use  Euler' s 
method 

(6.5)  yi  “ y0  = W0 

to  generate  the  extra  starting  value  y^.  The  method  (6.4)  is  of 
order  2 ; Euler's  method  is  of  order  1,  so  the  error  made  in  using  it 

O 

once  to  generate  y1  is  (9(h  ) . 

Once  we  have  constructed  complete  initial  data  at  t = 0,  we 
can  solve  (3.1)  numerically  using  standard  difference  methods.  We 
use  the  leap-frog  scheme 

(6.6)  u(x,  t + k)  = u(x,  t - k)  + £ A(u(x  + h,  t)  - u(x  - h,t)) 

where  k is  the  step  length  in  the  t direction.  We  use  periodic 
boundary  conditions  u(0,t)  = u(l,t),  and  forward  differencing  in 
time 


(6.7)  u(x,k)  = u(x,0)  + ^ A(u(x  + h,  t)  - u(x  - h,  t)) 

to  get  started.  This  method  is  also  of  order  2. 

Since  this  approach  is  essentially  the  combination  of  two 
well-posed  problems,  we  expect  no  difficulties  in  the  computations, 
and  none  arose  in  our  test  computations.  We  applied  this  method  to 
equation  (6. 1)  with 


9 6 


u]L(x,0)  = 10  + sin(2mc) 
d^^x^O)  = - sin(2Trx) 


u2(0,0)  = 0 


At  each  time  level,  the  L norm  of  the  actual  solution  is  10.1. 

2 

The  -t  norms  of  the  errors  in  the  computed  solution  are 


II 

* 

-IS 

II 

A 

1 

250 

X 

-[S 

II 

x: 

u2(*,0) 

.00063 

. 00016 

U (-,h 

.15 

.037 

u (*>1) 

.29 

.073 

1 

500 


(where  the  £ norm  of  a function  defined  on  a grid  is  the  square 

root  of  the  average  of  the  squares  of  its  values  at  the  grid  points). 

Notice  that  dividing  h and  k by  2 divides  the  error  by  4,  as  is 

expected  of  second  order  methods.  We  also  ran  a test  where  we  used 

u^(x,k)  and  u^(x,-k)  as  data  to  approximate  u^(x,0)  and  d^u^(x,0). 
2 

The  l errors  were  virtually  identical  to  those  above. 

It  is  clear  that  the  success  of  this  approach  does  not  depend 
on  the  particular  choice  of  numerical  methods  to  solve  the  ordinary 
differential  equation  (3.2)  and  the  partial  differential  equation  (3.1). 
This  approach  also  extends  readily  to  the  situation  in  Theorem  4.9. 


Methods  Us ins  u1  at  Several  Time  Levels 

For  simplicity,  we  assume  that  u^(x, 0)  = 0.  If  not  and 
u^CxjO)  = v1(x),  then  we  can  subtract  off  from  u the  solution  of 
(3.1)  with  initial  conditions 

Uj  (x,0)  = vx(x)  , u2(x,0)  = 0 

and  proceed  as  if  the  original  u-^XjO)  were  0. 

We  first  consider  trying  to  recover  u2(x,0)  by  measuring 
u^CxjO)  (which  we  assume  is  0),  u^x,^),  and  u2(0,0)  where 
t^  € (0,tQ]  and  u(?,t)  is  defined  by  (2.35).  By  Theorem  3.6,  we 
know  that  uniqueness  depends  on  the  irrationality  of  t^d  where 

(3.12)  d = ((a  - c)2  + 4b2)  ~^2 

and  even  if  we  have  uniqueness,  u2(x,0)  does  not  depend  continuously 
on  this  data.  However,  if  Ug(x,0)  has  a finite  Fourier  expansion, 
or  if  we  want  to  recover  only  a fixed  finite  number  of  the  Fourier 
components  of  u2(x,0),  we  can  consider  using  data  of  this  form  if 
we  make  a judicious  choice  of  t^  in  view  of  Lemma  3.5>  which  gives 

(6.8)  uL(|,t)  = i e7ri|t(a+c)  sin(7r|td)  u2(|,0) 

We  performed  an  experiment  to  compute  u2(x,0)  from  the  data 
u^x,^)  and  u2(0,0)  under  the  assumption  that  u^(x,0)  = 0.  We 
use  the  measurement  of  u2(0,0)  to  determine  the  constant  term  of 
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u2(x,0).  The  method  we  use  follows. 

Form  the  matrix  E (which  depends  on  t^)  which  maps  the 
initial  conditions  in  discrete  form  into  an  approximation  of  u^Xjt^) 
in  discrete  form  using  leap-frog.  Setting 


(6.9) 


V = (u2(h,0),  u2(2h,0),  ...  , u2(l,0))’ 

U = U(tL)  = (uL(h,tx),  u]_(2h,t1),  ...  , u^l,^))’ 


the  j-th  column  of  E is  the  computed  value  of  U using  leap-frog 
(with  forward  differencing  to  get  started)  with  initial  data 
u1(x,0)  = 0 and  V = e^  = (0,  . . . , 1,  . . . , 0) ' , the  1 occurring 
in  the  j-th  place.  We  want  to  solve 


(6.10) 


EV  = U 


for  V given  U.  If  n = l/h,  E is  an  nxn  matrix.  But  E is 
singular:  E(l,  1,  ...  , l)'  = 0.  If  n is  even, 

E(l,  0,  1,  0,  ...  , 1,  0)'  = E(0,  1,  0,  1,  ...  , 0,  1)'  = 0 


If  n is  odd,  we  find  experimentally  that  the  rank  of  E is  n-1, 
and  that  the  (n-1)  x (n-1)  matrix  obtained  by  deleting  the  last  row 
and  column  of  E is  nonsingular.  We  delete  the  last  column  of  E 
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\ r 


Or  * 


(i.e.  we  move  it  to  *he  other  side  of  the  equation)  by  using  our  extra 


f 

t 

*VV 

piece  of  information  (the  measured  value  of  u2(0,0)  = u2(l,0)).  Then 

V, 

ignoring  the  n-th  equation  (i.e. the  last  row  of  E),  we  solve  the 
resulting  nonsingular  (n-l)  x (n-l)  linear  system  by  Gaussian  Elimination. 

We  applied  this  method  to  the  equation  (6.1)  (for  which 
d = 10)  with  the  actual  u2(x,0)  being  1 - cos(4toc)  for  many  values 
of  t^,  with  h = l/l9  and  k = I/38O.  The  error  in  the  computed 
u2(x,0)  behaves  as  we  would  expect  from  (6.8):  if  | sin(7rd|t^)  | 
with  |=2  is  close  to  0,  the  error  is  large  (for  t^  = 20/380,  the 
relative  error  is  4.0)  and  if  | sin(-rr|t1d)  | is  close  to  1, 

the  error  is  small  (for  t^  = 9/3^0,  the  relative  l error  is  .016). 

, We  applied  this  method  to  equation  (6.1)  with  the  actual 

u2(x,0)  having  other  single  nonzero  Fourier  coefficients.  There  were 

2 

similar  relationships  between  the  relative  l error  in  the  computed 

u2(x,0)  and  the  size  of  | sin(7r|t1  d)  | . Also,  if  the  actual  u2(x,0) 

2 

had  several  nonzero  Fourier  coefficients,  the  relative  -t  error  in 
the  computed  u2(x,0)  was  small  only  when  | sin(7r|t^d) | was  not  close 
to  0 for  all  |*s  with  nonzero  Fourier  coefficients. 

We  now  consider  trying  to  recover  u2(x,0)  by  measuring 
u1(x,0)  (which  we  assume  is  0),  u^x,^),  ...  , u^(x,t  ),  and 
uo(0,0)  where  0 < t,  < •••  < t < t-.  Remarks  similar  to  those 
above  concerning  uniqueness  and  continuous  dependence  apply  here  as  well, 
but  we  have  more  parameters  them  just  t^  at  our  disposal. 


We  performed  an  experiment  to  compute  u2(x,0)  from  the 
data  u^(x, t^),  ...  , u^(x,tm),  and  u2(0,0)  under  the  assumption 


— 1 — 1 — ■ ■ — ’ • - — - 

t 

that  u^(x,0)  = 0.  Using  the  notation  introduced  above,  we  want  to 
solve  the  overdetermined  system 


i 

w 


*E(t1)  ' 

'u(tx)  ' 

' 

(6.11) 

E(t2) 

V = 

u(t2) 

1 

.E<V  - 

-U<V  - 

for  V given  U(t^),  ...  , U(tm).  If  n is  odd,  again  we  found 
experimentally  that  the  rank  of  the  mn  X n matrix  in  (6.11)  is  n-1. 

We  want  to  solve  (6.11)  in  the  least  squares  sense.  To  do  this,  ve 
compute  the  singular  value  decomposition  of  this  matrix  using  the 
Golub-Reinsch  algorithm,  and  then  apply  the  pseudo- inverse  of  this 
matrix  to  find  the  least  squares  solution  of  (6.11)  with  minimal 
norm.  See  Golub  and  Reinsch  (1971)  and  Dahlquist,  Bjorck,  and  Andersen 
(197M  for  discussions  of  the  singular  value  decomposition,  pseudo- 
inverses, and  linear  least  squares  problems.  We  then  add  the  same 
constant  to  each  element  of  V to  obtain  the  correct  uo(0,0). 

We  applied  this  method  to  the  equation  (6.1)  with  the  actual 
u2(x,0)  being 

(6.12)  Ug(x,0)  =6-3  cos(2t rx)  - 2 cos(birx)  - cos(6t7x) 

and  with  h = l/l9  and  k = l/380.  The  norm  of  u2(x,0)  is 

= 6.6.  The  £ errors  of  the  computed  approximations  to  u2(x,0) 
for  several  test  cases  were: 
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Intermittent  Updating 


We  consider  the  method  of  updating  to  obtain  for  the 

equation  (3.1)  in  this  section.  We  assume  that  b ^ 0.  The  results 
in  Chapter  III  give  us  hope  that  we  have  enough  data  to  determine  u^ 
if  we  know  the  constant  term  of  u2  and  if  we  know  u^  for  a sufficiently 
dense  set  of  times  to  meet  our  accuracy  requirements.  The  question  here 
is  how  the  method  of  intermittent  updating  makes  use  of  this  data. 

Since  equation  (3.1)  is  reversible  in  time  and  since  it  has  constant 
coefficients,  we  consider  updating  which  only  moves  forward  in  time; 
our  results  can  easily  be  modified  to  handle  the  case  of  integrating 
forward  and  backward  in  time. 

Suppose  u(x,t)  is  a solution  of  the  equation  (3.1).  Let 
g(x)  be  some  initial  guess  for  u2(x,0).  Define  the  function 
v = v(x,t)  = (v^Vg)'  by:  for  0 < t < t,  v(x,t)  is  the  solution 
of  (3.1)  with  initial  conditions  v^(x,0)  = u^(x,0)  and  v2(x,0)  = g(x); 
inductively  for  jr  < t < (J+1)t,  v(x,t)  is  the  solution  of  (3.1) 
with  initial  conditions  v^(x, jc)  = u^(x,  Jt)  and  v2(x,Jt)  is 
obtained  from  the  previous  interval.  For  now,  t is  a fixed  positive 
number  which  we  call  the  frequency  of  updating.  If  u^(x,t)  is 
measured  at  jr  for  j = 0,1,2,...  and  we  have  an  initial  guess 
g(x)  for  u2(x, t),  then  v(x,t)  can  be  approximated  by  finite 
difference  methods  in  the  obvious  way:  while  proceeding  with  the 
numerical  Integration,  replace  the  computed  v^(x,t)  by  u^(x,t) 
at  the  times  when  u1  has  been  measured. 

Let  t^  = Jr  for  some  J.  Let 
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Then  (f^x),  g^x)  ) 
initial  data  u, (x.t 


is  the  solution  at  t = t.  + t 


so  by  Lemma  3.5 


j-  sin(Tr|Td)  g^({) 


Now,  it  is  easy  to  show  that  for  solutions  u 


is  independent  of  t for  each 


Combining  (6.13),  (6.l4),  (6.15) 


and  Parseval' s relation,  we  obtain 


Note  that  4b  < d , with  equality  if  a = c.  Since  is  reset  to 
u^  at  t^  + t,  (6.l6)  reflects  the  total  error  in  v as  an  approxi- 
mation to  u at  tL  + t.  The  effect  of  intermittent  updating  is  to 
decrease  each  Fourier  coefficient  of  the  error  \ - u by  a factor 


2 2 2 1/2 

of  (l  - (kb‘  /d  ) sin  (7T5Td))'v  per  iteration  as  we  advance  in  time. 

This  factor  is  always  at  most  1.  Note  that  it  depends  on  |. 


In  particular,  the  f = 0 Fourier  coefficient  of  the  error  - u2 


remains  the  same.  Thus  g(0)  should  be  u2(0,0)  if  we  want 
l|v2  - u2H0  ->0  as  t -»  « . 

A perhaps  unexpected  result  of  this  is  that  malting  x smaller 
(i.e.  using  more  information  about  u^)  does  not  necessarily  make 


v0  approach  u0  faster  as  t increases;  it  may  make  things  worse. 

If  t is  close  to  0,  the  decrease  factor  will  be  close  to  1 for  small 
|||,  which  is  not  desirable.  Thus  in  intermittent  updating,  it  is  not 
always  best  to  "throw  in"  any  and  all  measured  data— even  if  it  is 
accurate.  The  way  the  process  works  requires  enough  time  between  up- 
dates of  v^  for  some  of  the  energy  of  the  error  to  pass  from  the 
second  to  the  first  component,  and  then  out  of  the  system  when  v^  is 
updated.  We  remark  that  the  problem  is  not  that  we  have  too  much  data, 
but  that  the  updating  process  does  not  use  the  data  to  its  best 
advantage.  However,  since  the  updating  process  is  not  as  costly  as 
methods  for  least  squares,  it  still  may  turn  out  to  be  the  most 
efficient  method,  provided  that  we  can  find  modifications  to  prevent 
slow  convergence  due  to  decrease  factors  close  to  1. 

We  applied  this  method  to  equation  (6.1)  in  two  test  cases. 

In  the  first,  the  initial  data  of  the  exact  solution  is  u^CxjO)  = 0 
and  u2(x,0)  = 1 - cos(27rx)j  only  | = 0 and  | = 1 occur.  We  used 

g(x)  h 1 as  our  initial  guess  for  u2(x,0)j  note  that  the  necessary 

/\ 

condition  g(0)  = u2(0,0)  is  satisfied.  The  numerical  integration 
was  performed  using  the  Lax-Wendroff  method* 
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(6.17)  u(x,  t + k) 

= u(x,t)  + ££  A(u(x  + h,t)  - u(x-h,t) ) 

2 

+ A2(u(x  + h,t)  - 2u(x,t)  + u(x-h,t) ) 

2h 

See  Kreiss  and  Oliger  (1975)  for  a discussion  of  this  difference  scheme. 
We  use  a one-step  method  to  avoid  the  difficulties  involved  in  trying 

to  update  a multi-step  method.  In  the  following,  the  observed  decrease 

2 

factor  (which  we  label  d.f.)  in  the  l norm  of  the  error  vg  - ug 
is  given  for  the  first  two  iterations.  The  initial  error  is  y/2/2  = .7 
T is  the  first  time  for  which  the  error  is  at  most  .1. 


T 

h 

k 

d.f. 

(l  - sin2(l07TT) ) 

T 

1 

125 

1 

20 

1 

125 

. 9686,  . 9683 

.9686 

00 

CO 

-4- 

• 

1 

25 

1 

20 

1 

125 

.3090,  .3009 

.3090 

.08 

1 

250 

1 

So 

1 

250 

.9921,  .9921 

.9921 

* 

1 

50 

1 

So 

1 

250 

.8090,  .8086 

.8086 

.20 

The  asterisk  indicates  that  at  t = .5,  the  error  is  still  .262. 

It  is  interesting  to  note  that  (.992l)^2^  • \/2/2  = *2^2,  80 
error  is  as  expected.  Observe  that  far  better  results  were  obtained 
with  a coarser  mesh,  but  with  a better  choice  for  t. 

In  the  second  test  case,  the  initial  data  of  the  exact 
solutionis  u^(x,0)  = 0 and  u2(x,0)  given  by  (6.12);  | =0,1,2, 
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and  3 occur.  We  used  g(x)  £ 1 as  our  initial  guess  for  u2(x,0); 
again,  this  gives  the  correct  £ = 0 Fourier  coefficient.  We  used 
the  Lax-Wendroff  method  (6.17)  with  h = l/k  and  k =l/250.  The 
initial  L error  of  v0  is  y/7  = 2.65. 

The  theoretical  decrease  factor  (l  - sin2(l07T|  r)  for 

£ = 1,2,  and  3 and  various  values  of  r are: 


H 

II 

ro 

** 

4k 

6k 

8k 

10k 

12k 

I = 1 

.9686 

.8763 

.7290 

.5358 

.3090 

.0628 

i = 2 

.8763 

.5358 

.0628 

.4258 

.8090 

.9921 

1 = 3 

.7290 

.0628 

• 6374 

• 9921 

.8090 

.1874 

Note  that 

this  factor  is  very  good  for 

1 = 1 

and  r = 12k, 

for 

1=2  and  t = < 

Sk,  and  for  | 

= 3 and 

t = 4k.  This  factor  is  very 

poor  for 

t = 2 

and  r = 12k 

and  for 

1 = 3 

and  t = 8k. 

The  l 

error  in  v? 

for  various  values  of  t and  t are: 

t = 2k 

4k 

6k 

8k 

10k 

12k 

t = .2k) 

.8151 

.2935 

.0934 

.7204 

.4265 

1.3814 

t = .480 

.3109 

.oki 

.0237 

.7621 

.1275 

1.3881 

t = .720 

.1189 

.0133 

.0211 

.7551 

.0567 

1.4001 

t = .960 

.0484 

.0117 

.0168 

.6kl 

.0523 

1.3853 

(Note:  for  r = 

8k,  the  values  of  t are  .256 

, .448,  .704, 

and  .960.' 

If(  r is 

2k,  4k, 

6k,  or  10k, 

the  method 

works 

well  for  this 

test  case 

For  t = 9k,  the  | = 3 Fourier  component  is  difficult  to  recover; 
for  t = 12k,  the  | = 2 Fourier  coefficient  is  difficult  to  recover. 
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We  see  that  for  equation  (3.1),  there  are  two  possible  reasons 


that  would  cause  the  error  in  vg  to  approach  a nonzero  asymptotic  value 
The  first  is  if  the  initial  guess  g(x)  for  u (x,0)  has  the  wrong  | = 0 


Fourier  coefficient.  The  second  is  if  the  frequency  of  updating  t 


happens  to  yield  a decrease  factor  which  is  very  close  to  1 for  some  | 


The  second  of  these  problems  might  be 


eliminated  if,  instead  of  restricting  ourselves  to  one  fixed  frequency 


of  updating,  we  use  several  different  frequencies  x 


we  update  v..  at 


and  continue  to  repeat  this  cycle.  The  gain  we  make  by  doing  this 


is  analogous  to  the  gain  we  make  by  measuring  u..  at  t = 0,t 


instead  of  just  at  t = 0 and  t1  in  the  previous  section.  If  one 


value  of  t yields  a poor  decrease  factor  for  a particular  §,  then 


hopefully  some  other  value  of  t in  the  set  x 


t will  yield 


a good  decrease  factor  for  this  |.  Repeating  the  cycle  ensures  that 


if  even  one  of  the  x . 's  yields  a reasonably  good  decrease  factor 


for  this  then  eventually  the  |-th  Fourier  coefficient  of  the  error 
in  will  be  small. 

Temperton  (1973)  suggested  that  using  different  frequencies  of 

updating  may  be  beneficial,  but  he  rejected  this  idea  because  his  test 

results  did  not  substantiate  it.  In  the  experiment  he  performed,  he 

used  r ^ = 12k,  = Ilk,  = 10k,  ...  , = 3k,  and  he  stopped  after 

the  tenth  iteration  and  did  not  repeat  the  cycle.  We  suggest  that  it 

may  be  better  to  keep  m relatively  small,  choose  t.'s  which  are  sub- 

J 

stantially  different,  and  to  repeat  the  cycle. 

We  ran  some  experiments  for  our  second  test  case  using  these 

2 

ideas,  and  the  following  are  the  results  for  the  4 errors  in  v0: 


CO 

II 

r—J 

t1  = 2k 

t2  = 4k 

t = 2k 

r2  = 4k 

Tj  = 6k 

t.  = 4k 

x2  = 4k 

Tj  = 6k 

eg 

II 

Tg  = 8k 

t r-  .240 

.4125 

. 1935 

.O84o 

.0551 

t = .480 

.0788 

.0274 

.0431 

.0347 

t = .720 

.0189 

.0232 

.0452 

.0377 

t = .960 

.0121 

.0194 

.0385 

.0334 

Notice  that  we 

have  convergence 

of  Vg  to 

u^  without  difficulty  in 

each  of  these  four  cases,  substantiating  our  suggestion  for  the  sample 
equation  (3.1). 
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